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PREFACE 


Quality Control is not a new statistical theory, but a most 
convenient arrangement of the geometry of statistical dis- 
tributions. So convenient, in fact, that it enables the practical 
production man to handle that “perilous stuff” probability 
in a most efficient manner. He thus becomes the master of 
chance in its manifold influences upon the quality of manu- 
factured product, instead of its victim. 

This feature of being not a new theory but a convenient 
arrangement of the mathematics involved in statistical problems 
Quality Control has in common with a method which is both 
more difficult to handle and more ambitious in its aims, called 
Analysis of Variance, and which has become an important 
tool for ascertaining tire source of variability of industrial 
product. The relation between the two methods, given by the 
fact that Analysis of Variance arranges the arithmetic of statistical 
problems in a way very similar to how Quality Control arranges 
their geometry^ has been stressed throughout tlie relevant por- 
tions of this book, and this has been found to offer an easy 
approach to two methods otlicrwise not easily accessible to 
the general reader : Analysis of Variance and Theory of 
Correlation. 

Concerning the representation of statistical methods, I 
have acted upon the principle that mathematics is a. language 
— ^in symbols instead of in letters — ^which can be translated, 
like any other language, into English, and ought to be if the 
reader is to become familiar with “ the Why and the Where- 
fore,” and not only with the “ How,” of applying statistical 
methods. This book should therefore be looked upon not 
merely as a manual of mechanical rules for the application of 
statistical methods, but as one of statistical thought. If the 
reader learns to look at it in this way, he will find it helpful 
in many other ways of life. ^ 

There is, however, as any practical linguist knows, a hmit 
to “ translatability ” from one language into another. Certain 
expressions in one language have no exact equivalent in the 
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Other. Instead of making the translation too heavy and 
unwieldy by circumscribing such expressions) it is often better 
to use them as “ loan words.” The reader should regard in 
the same way the mathematical expressions — symbols and 
formulae — used in this book) whose amoun t, however, has been 
cut down to a minimum. He should take them “ on loan,” 
and not omit to pay interest on them by a full understanding 
of their meaning. If he has had secondary school education, 
he will find himself amply equipped for following the mathe- 
matical reasoning. In fact, it does not presuppose a thing 
beyond the four fundamental arithmetical operations of adding, 
subtracting, multiplication, and division, and some under- 
standing of graphs. 

This book has grown out of lectures delivered before 
audiences of men and women engaged in production. Since 
to the majority of them the term “ Statistics ” meant nothing 
but the (so it seemed to them) “ dry and tedious ” procedures 
for collecting information about Marriages, Births and Deaths 
for purposes of the Registrar-General, and since they were 
required to translate into practice almost immediately what 
they were taught at these lectures, it seemed imperative for 
obtaining good results to make the subject as palatable as 
could be. 

To rouse their interest and make the matter take firm roots 
in their minds, their attention was drawn to what may be called 
the “ Romance of Chance,” in keeping with Prof. White- 
head’s dictum when criticizing conventional scientific education: 
“ Without the adventure of Romance you get inert knowledge 
and, at its worst, a contempt of ideas.” This line of approach 
has been preserved in the design, execution and style of the 
book. 

A good deal of the difficulties and doubts of the audience 
as they came up during the discussions, and on subsequent 
occasions during shop inspection, has been worked into the 
text. The book has thus become a sort of translation of pro- 
duction and inspection problems into the language of statistics, 
rather than a collection of examples to illustrate the theorems 
of statistics. 

The book is meant to form an introduction to all branches 
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of Quality Control which have been found useful in practical 
work, such as the principal technique of Contwl Charts (Chapters 
II and VI), first developed by Shewhart, Pearson, Budding, and 
^ennett, the technique of Probability Graphs (Chapters III and 
7.8-7.11), two standard methods o[ Sampling Inspection (Chapters 
VII and VIII), due to L. E. Simon and Dodge-Romig, and two 
methods indispensable forjudging experimental results. Analysis 
of Variance (Chapter IV) and Conelatiom (Chapter V).* 

Apart from the references given above and the works cited 
in their appropriate places, indebtedness is expressed to 
authors and publishers for permission to reprint tables and 
diagrams from the following works : 

Davies and Pearson, “ Methods of Estimating from Samples 
the Population Standard Deviation,” J.R.S.S., vol. i. 
No. 1, 1934. 

Croxton and Crowder, Applied Geneml Statistics. London, 
Pitman & Sons, Ltd. 

L. E. Simon, An Engineer's Manual of Statistical Methods. 
New York, John Wiley & Sons ; London, Chapman & 
Hall. 

Dodge and Romig, “ Single Sampling and Double Sampling 
Ins]Dection Tables,” The Bell System Technical Journal, 
published by the American Telephone and Telegraph 
Company, 195 Broadway, New York. 

In a work of this kind it is neither possible nor necessary 
to give exact references for the authorship of each detail of 
theory or procedure. Consequently, I have not thought it 
necessary to stress my own authorship in places where a 
method or some information of theoretical interest was due to 
myself. The reader acquainted with the literature of Quality 
Control will no doubt find several new departures in theory 
and method, but it is before all the welding of the various 
branches of Quality Control into one unified whole, attempted 
here for the first time, which is claimed as original. 

As far as examples are taken from actual experience, they 
have been altered so as to avoid disclosing actual processes but 
riot in a way prejudicial to their illustrative value, 

* Since this was written, Slatislical Quality Control by E, L. Grant has been 
published in U.S.A. 
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NOTE 

J^fumeration of Paragraphs, Figures, and Equations 

Each paragraph in this boot is distinguished by a number 
consisting of the chapter number prefixed to the number of 
the paragraph in the chapter, e.g. 5.9 means the ninth 
paragraph of Chapter V. A similar system of numeration 
is used for tables, figures, and equations, e.g. Fig. 3.2 means 
the second figure in Chapter III, and similarly for tables arid 
equations. 



Chapter I 


LAWS OF CHANGE— THE NORMAL LAW 
OF ERRORS 

I.I The Case Against loo per cent. Inspection 

In industrial work we are always faced with the problem of 

having to judge lot quality from sample quality. 

There seems to be one royal road for judging lot quality, 
and that is by testing everything. This is called 100 per cent, 
inspection. Close examination shows that 100 per cent, inspec- 
tion is not infallible. 

The reasons against it are these ; 

(1) 100 per cent, inspection is seldom done, even when it is 
supposed to be clone. The examiner in charge of it will 
usually test as much as he or she can, and then pass on 
the rest unexamined. 

(2) Even when 100 per cent, inspection is conscientiously done 
one cannot expect it to be without errors. If an examiner 
is required to test, say, 8,000 small articles for a certain 
dimension during one shift, it is inevitable that, sooner or 
later, errors will be committed. Inspection fatigue sets in 
very soon, as experience shows, and makes the inspection 
results more or less unreliable. 

(3) Costs and labour required for 100 per cent, inspection are 
as a rule prohibitive. The use of gauging machines in 
place of human l^eings — ^where the characteristic is measur- 
able — is no satisfactory way out of the difficulty, because 
it has been shown that gauging machines are subject to 
the same types of error as other processes in nature. 

The alternative method for assessing lot quality is to take 
comparatively small samples and judge the quality of the lot 
from the quality of the samples. 

Before the appheation of statistical laws to this kind of 
problem, the lot quality was .simply taken to be the same as 
the sample quahty. In other words, one did not take into 

1 
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account the fact that the sample was only a sample from the 
whole lot, and as such could show not only the same quality 
as the lot but also a better or worse quality. 

I One always had, of course, a vague idea that chance came 
into play whenever lot quality was judged in this way, but 
chance was considered to be of such an elusive character that 
the only thing that could be done about it was to ignore it. 
This was a very unsatisfactory state of affairs because one 
thus became the victim of chance instead of its master, and 
in most cases evidently judged lot quality wrongly, 

I 

1.2 Laws of Chance 

There is, however, no longer any necessity for ignoring 
chance, because to-day a good deal is known about its working. 
In fact, laws of chance have been discovered from which it can 
be clearly seen that chance is not at all the mysterious entity 
which is supposed to suffer events to happen without a sufficient 
cause, but something much more prosaic. We have learned 
to tune chance a good deal flatter than we did before. 

It is interesting to learn how people came to study chance. 
It was in the eighteenth century when the French aristocrats 
were desirous of making fortunes at the card table, or in other 
forms of gambling. In order to make sure of success they 
hired mathematicians to work out the perfect schemes for 
winning at these games of chance. Although it is not known 
whether they were rewarded in their efforts, the findings of 
the mathematicians are recorded, and represent the first books 
on the laws of chance. Later on those studies extended from 
games of chance to almost every branch of human knowledge 
and it was found that what these mathematicians had dis- 
covered were laws of nature, or, at least, mathematical laws. 

1.3 The Normal Law of Errors 

Quality Control is based upon certain law.s of chance, and 
we shall therefore consider in detail what is meant by a Law 
o^ Chance. 

Supposing a great number of shots are fired at a target by 
a fairly good marksman. Let the experiment be repeated a 
number of times, and let the experiment be repeated also by 
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using different marksmen ; then the. scatter of shots on the 
target will always be different from one experiment to the 
next, but in spite of these differences a certain pattern will be 
discernible. The pattern will be that the shots nearest the 
centre of the target will be most frequent, and their density 
will decrease as we move away from the centre in any direction. 



Fig. 1.1 Distribution of Shots on a Target 


Let US further suppose that the target is divided into equal 
strips 1 in. wide. We can then “group ” the observations on 
the target by counting the hits contained in one strip and 
erecting over a horizontal base representing our scale, rect- 
angles, one for each strip interval, whose areas are propor- 
tionate to the number of observations in each strip. 

The resulting picture is called a histogram (Fig. 1.2). If 
the total number of observations is increased indefinitely, so 
that the class intervals can be correspondingly decreased, and if 
the mid-points of the small side of the rectangles are connected 
by means of a continuous curve we arrive at what is called the 
Curve of Errors, or the Normal Law. 


The algebraic expression of the normal curve is 

h 1 . 1/F 

JV = 


fi =- 


e 


yir a-\/2ir 

where x is the deviation of a measurement from the mean, 
h is the measure of precision equalling 1 

c is the standard deviation, (see paragraph 2.1), 
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The integration of this expression between certain limits then gives 
the proportion P of measurements lying within these limits, and is 

known as the Error Function P/ 2 = J ^ 2\ff/ • * 

(see Appendix I) 



This is the basic law of chance used in Qpality Control 
work. 

It is a law of chance for this reason. If somebody were to 
ask the marksmen what made dieir shots deviate from the 
point at which they precisely aimed, they would probably 
answer — barring any stronger language — “ It is just chance.” 
For the single shot there is no prediction possible as to where 
it will strike. One understands here, in a vague way, what 
the meaning of chance is. It is the combined effect of a multi- 
plicity of very small causes, so small and so many that it is 
impossible to disentangle them or enumerate them, and 
consequently to predict the effect they will have on the aiming. 
In the present case, these small causes can be slight differences 
in powder, or in tire dimension of the projectile, differences in 
the atmosphere, differences in position of the marksmen, etc. 

It is a law of chance because, knowing the properties of the 
Qurve of Error, it is possible to predict how many shots would, 
on pure chance, be observed within or ' without a certain 
distance on either side of the centre. 

Let it be noted that it is not the shape of the curve which 

(«X2) 
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matters but its structural properties. The shape of the curve 
may vary, but its fundamental properties remain the same. 
The most outstanding of these properties, which enables one 
to use the curve for prediction of the kind mentioned above, 
is this. If we project the point in the curve where the convex 
changes into the concave on to tire base, the distance of that 
point from the mean is called standard deviation. Now the 
properties of the curve are such that no matter what its shape 
is like, there will always be the same proportion of observations 
within (or without) the same multiple of tire standard deviation 
on either side of the mean. This makes prediction possible, 
and prediction is scientifically conceivable only if the matter 
is governed by a law.* 

It is this Normal Law which, as observed, forms the basic 
concept of Quality Control. It is easy to see why this should 
be so. It has been found to apply everywhere in nature where 
a multiplicity of small causes brings about a certain effect. 
In industrial work it is this multiplicity of small causes which 
makes it difficult to work to what is called Specifications, laid 
down for important characteristics, which must be observed 
if the product is to be acceptable. If we think, for instance, of 
a certain characteristic like the length of an article, then the 
number of small causes entering into the process and deter- 
mining the length dimension are legion. Consequently, the 
concept of chance will enter into our considerations. Now it 
has been found empirically that industrial measurements of 
that kind obey the normal law very nearly. That is, if the 
measurements of, say, length of an article, are taken for a 
considerable number of such articles, and are grouped in the 
way described above, the distribution will follow more or less 
closely the normal law. We are then able to predict how much 
of the bulk of the product will be within certain distances of 
the mean. We arc also able to determine whether the bulk 
range is compatible with the specifications, and if not, to 
ascertain the causes that disturb a stable and, from the point 
of view of design and specification, acceptable production. 

♦ For a clear and concise account of the mathematical ideas underlying the 
conception of the Normal Law, the reader is referred to Levy and Preidel, Elemeri' 
tarj Statistics (Nelson) 

m) 
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1.4 Application of the Normal Law to Gunnery Work 

Quality Control is essentially a method designed to assist 
the production people in their work. As such it is a simplifica- 
tion of statistical methods which it would otherwise be very 
difficult to apply to mass production. 

To ascertain whether or not a distribution follows the 
Normal or any other law, and what is the particular shape of 
such a distribution is a difficult business, and people engaged 
in the production shops cannot be supposed to carry out 
calculations of that kind. A simplification is therefore 
necessary if statistical methods are to be of any use in 
practical work. 

The necessity of transforming the complicated laws of 
chance into mathematical tools which are easy to handle has 
been felt in other branches of human knowledge, for instance, 
in gunnery work. 

The gunnery officer in the field has no time to carry out 
a complicated analysis of the scatter of his shots. On the 
other hand, with our modern weapons it is necessary to do 
an analysis of a certain kind in order to know whether the gun 
is satisfactorily aimed. 

If the gun is aimed at a certain distant point and a 
great number of shots are fired without altering the position 
of the gun, it has been found that the shots do not by any 
means all strike at one point. A scatter results, and the 
distribution of the shots in the main direction, say, of the 
scatter, follows the normal law. The reason is simply that 
a great number of small causes combine in bringing about 
deviations of the single shots from the centre of the 
distribution. 

The distribution is briefly this : most of the shots will be 
around what is called the “ centre of impact,” which may or 
may not be on the target, and as one moves farther away from 
the centre of impact on either side, the number of shots per 
interval will decrease. 

Taking an ideal case we would get the following distribution, 
*^here E represents a distance related to the above-mentioned 
standard deviation (in fact it is -6745 times the standard 
deviation), called the Probable Error. 
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Table 1.1 


Ta T, 
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E, 2E, etc., are 
distances from 

2 

7 

16 

25 

25 

10 

1 7 

2 

centre of impact 


Percentages of Shots falling in each Zone 


Now supposing the gun in actual practice or in warfare is 
aimed at a certain distant object and a great number of shots 
are fired. The ofRcer in charge of the gun wants to know 
whether the gun is correctly aimed and whether most of the 
shots really strike in effective distance of the target. He does 
not see where the shots strike but his observer officer reports 
the number of shots which are seen to strike beyond the target. 
Supposing he reports that 9 per cent, of the shots are seen to 
strike beyond the target (Ti). If the centre of impact was on 
the target, then 60 per cent, should be seen striking beyond and 
60 per cent, short of the target. From the report received the 
gunnery ofRcer knows immediately that the centre of impact 
has to be shifted by two probable errors in order to bring it on 
the target. The probable error for a type of gun, and also 
for the individual gun, is a known quantity, and the gun can 
be re-aimed accordingly. 

To take another example, supposing the observer officer 
had reported that 25 per cent of the shotswere seen to strike short 
of the target (T^) . Then the gunnery officer knows that the 
centre of impact has to be shifted backwards by one probable 
error in order to bring it on the target. 

1.5 The Transition firom “ Probability of Fire ” to “ ProbabUity of 
Good Production” 

The same type of simplification for working with the normal 
law is also used in Quality Control work. Instead of con- 
structing curves and ascertaining their properties, we woA 
with certain distances from the mean, be it the specification 
mean or a calculated one, and ascertain the number of observa- 
tions within and without these distances. 
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One might be inclined to say ; “ We do not see the redson 
why something applied in artillery work should be applicable 
to mass production. After all the firing of a gun is something 
very different from the turning out of articles on a machine.'” 
To counter such an objection, we need only realize that we 
actually have not only a quantitative target in industrial work 
but also a qualitative one, and that what we really aim at is 
to “ hit ” that target. 

The qualitative target for industrial work is the Specification 
Mean, together with the Tolerances on either side of the mean. 
The product is acceptable only if the characteristic in question 
comes out within this target. It is thus actually, in the full 
sense of the term, a question of hitting the target. Furthermore, 
as has been pointed out under 1.3, tire great number of small 
causes responsible for a given characteristic of an industrial 
article behave like any other combination of small causes ; 
they produce a scatter or variation of measurements of that 
characteristic which more or less complies with the Law of 
Error. Seeing that in this respect the “ Probability of Fire,” 
as the gunnery officer calls it, and the “ Probability of Good 
Work” in mass production are governed by the same type of 
law, we must conclude that the production man is simply in 
the position of the gunner, and the production engineer in the 
position of the gunnery officer, and that the machines or pro- 
cesses have to be directed by the help of Laws of Chance in such 
a way as to make the product come out on tire target. Thus it 
is quite natural to apply to the control of mass production a 
simplification which has been found very useful in external 
ballistics. If for instance, we find that the average of the 
dimension in question would be a good distance from the 
Specification Mean, we know that the “ centre of impact ” 
of the process has to be shifted so as to bring it back to the 
specification mean. Or, if we were to find that the scatter is so 
great as to make the number of articles around the specification 
mean small, the Production Engineer has to do everything in his 
power to reduce that scatter so as to make the product come out 
j&operly within the target, thinning out towards its boundaries. 

The statement that “ industrial measurements , . . obey the 
normal law very nearly” (paragraph 1.3) is open to criticism 
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if taken loo literally. It is, in fact, an over-simplification in 
two I’cspects. 

Firstly, when we say that measurements are distributed 
'‘according to a law of chance, this must not be taken to imply 
that there will be no assignable causes responsible, in varying 
degree, for some or all of the measurements of that kind. On 
the contrary, we must be fully alive to the fact that more often 
than n.ot such disturbing causes will be present. It is just 
because of this that we need quality control to weed out these 
undesirable influences. It follows that a pure chance system 
will be the result rather than the starting-pbint of quality 
control work. The chance cause system which is, by 
definition, free from interference by assignable causes is thus 
not “ given,” as a rule, but is what we are aiming at. It is 
now an empirical fact that such a state can be reached by a 
suitable method of quality control. 

Secondly, the form of the distribution of measurements is, 
as a rule, only approximately normal, and not infrequently 
distinctly skew. Flowever, as we shall see in Chapter II, it is not 
the distribution of individual measurements with which we are 
directly concerned in Qiiality Control work, but that of the 
averages or other statistical parameters (characteristic quan- 
tities). Now it is again an empirical fact, supported and 
explained by statistical tlieory, that the distribution of averages 
(which is the most important statistic in this connection) is 
approximately normal even though that of the individual 
observations be very skew. 

1.6 The Place of Quality Control in the Development of Statistical 
Method 

What we call “ statistical method ” had its origin in the 
Theory of Errors. That theory taught us how to find the best 
approximation to the “ true ” value of a given thing, e.g. a 
physical constant, an atonfic weight, the position of a star, etc. 
Only me value could here be the true value, and as the deter- 
minations were found to be always more or less discordaih, 
among themselves, the arithmetic mean was taken as repre- 
senting the best estimate of the “ true ” value, in accordance 
with the Theory of Errors. 
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Population statistics and biological statistics as wc know 
them started when it was found empirically that the varying 
values of a given characteristic, such as the chest measurements 
of a certain male population between, say, 20 and 50 ycarsT 
of age, or the weight of a certain kind of beans, were distributed 
around a mean value similar to the measurements of a single 
thing. 

But notwithstanding the formal likeness of distribution, the 
object is here a very different one. We have not any longer 
one thing which we measure repeatedly in order to ascertain 
its true value, but diffeient individuals (representing probably 
as many “ true ” values) each of which is measured only once, 
and what we are after is not a “ true ” but the “ typical ” 
(mean) value of the characteristic for the given group of 
individuals, and its variabihty. 

No doubt, if we measured an individual repeatedly, we, 
would get somewhat discordant values. But the differences 
arising from such repeated measuring of one and the same 
individual are so small compared with the differences between 
different individuals that they can be neglected, and even 
must be neglected if we are to arrive at the kind of statistical 
information we are after in population and biological statistics. 

Tilhe application of statistical method to industrial measure- 
ments represents the next logical step in the development of 
such methods. The speed with which manufacturing condi- 
tions change compels us to use what may be called “ dynamic 
statistics.” 

Instead of assuming that we are dealing with one chance 
universe * whose statistical structure we determine so as to make 
comparisons possible with related chance universes, we now 
at the outset suspect that disturbing causes are present, or we 
take into account the possibihty, if not probability, that the 
product represents a mixture of different chance universes. 

Consequently, we are now primarily interested in the 
differences, or the distribution, of those values which are 
“typical ” for each of the partial universes, which we suspect 
to have contributed to our total of observations, or of which 
we suspect that total to be compounded. To get the overall 

* 'Lhis is the teim used for the total of observations 
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picture we want, and in order not to miss the forest because 
of the trees, we must now 1o some extent neglect the variation 
between individuals. 

The distributions with which we are, therefore, concerned 
here are distributions of means and standard deviations or 
ranges. They are called the “ Sampling Distributions ” in 
contradistinction to the Frequency or Parent Distributions of 
the Individuals. 



Chapter II 


rr 


THE RAIL LINES OF CHANGE— QUALITY 
CONTROL CHARTS 

z.i Statistical Control of Quality 

Remembering what has been said above (1.3) about industrial 
measurements following closely the Normal Law, let us now 
see in detail what simplification has been devised for working 
with tliis law in industrial practice. We know already that the 
simplification is of exactly the same type as that used in 
Ballistics. The fundamental property of the Normal Curve 
is used according to which certain distances from the mean are 
invariably bound up with certain percentages of the area under 
the curve. If we project the point of the curve where the con- 
cave changes into the convex on to the x-axis, then the distance 
from the mean to this point is called the Standard Deviation, 
because it represents a specially important deviation. It is, 
in fact, so important that it is used as a general measure of 
dispersion. It divides the area under the curve in the ratio 
2 : 1 (inside : outside). Realizing tliat the total area represents 
the total number of our observations (see Histogram, Fig. 1.2) 
we define the standard deviation as the distance from the mean 
for which there is a chance of 2 : 1 that an article picked out 
at random will not exceed it (see Fig. 2.1). 

A similar regularity governs the chances of deviations at 
certain multiples of the standard deviation. At twice the 
standard deviation on both sides of the mean, the ratio of 
articles “within'” (i.c. having deviations equal or smaller than 
two standard deviations) to those “ without ” (with dimensions 
• greater than two standard deviations) is very nearly 19:1. This 
means that of 20 articles picked at random fpom a very great 
number of pieces, 19 would lie within twice the distance of the 
Siandard deviation on either side of the mean. We can also 
express it by saying that if we pick out one article, the chances 
that it will lie within that distance of twice the standard 
deviatioil on either side of the mean, to that of its having a 

IS 
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dimension exceeding these limits are 19 : 1. At three times 
the standard deviation the ratio of articles “ witliin ” (i.e. 
having deviations equal or smaller than three times the standard 
deviation) to those “ without ” (i.e. with dimensions greater 
than three times the standard deviation) is 369 ; 1. 

This means tliat practically all the product will lie between 
plus and minus three times the standard deviatioiij provided 
that the variations of the characteristic measured are due only 
to chance, with no major causes interfering. 

The following diagram represents the relationship between 
distance and frequency in a normal curve, in graphical form.* 



Pig. 2.1 

The Normal Curve. Relationship between Deviation 
and Frequency of Occurrence 
SD represents the standard deviation 
Aj-ea within ± SD to area outside as 2 ; 1 
Area within ± 2SD to area outside as 19 : 1 
Area within + 3SD is area outside as S60 ; 1 

2,2 The 1/40 and i/iooo Control Limits 

For practical work a further simplification has been intro- 
duced. Areas are imagined to be cut off at 1.96 and 3.09 
standard deviations respectively. The advantage of doing so 
is that the chances of finding an article " within ” these dimen- 
sions to those of finding it “ without” are now exactly 19 :,1 
and 500 : 1 respectively. The probability of a measurement 
(lying beyond these limits on either side of the mean is now 

* see Appendix I for the relationship in aritltraeticai form 
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exactly -05 (5 per cent.) and '002 ('2 per cent, or 2 in 1,000), 
whereas at two and three standard deviations the probabilities 
are -046 and -0027 respectively. 

In the diagram 2 . 1 , an area is cut off on either side of the 
mean representing 2*5 per cent, on each side, or 6 per cent, 
on both sides. As the total ai'ea under the curve represents 
the total number of observations, there will be a chance of 
5 in 100 or 1 in 20 or 2 in 40 that an observation will fall 
outside that distance from the mean. As there is an equal 
chance of a point falling outside on either side of the mean, 
we call these lines 1 in 40 limits, meaning that there is a chance 
of 1 in 40 observations that a point will fall outside one of the 
control lines. 

We further cut off the very small part of the area under 
the curve which lies beyond 3-09 X S.D., which means that 
an area of •! per cent, has been cut off on either side, and 
therefore -2 per cent, on both sides together. 

Applying the reasoning as in the preceding paragraph, we 
should expect two observations in 1,000 to fall outside both 
these limits, or one observation in 1,000 outside each of these 
limits. These lines arc therefore called the 1 in 1,000 limits, 
and represent the distances within which practically all of the 
product is expected to lie. 

2.3 How to Arrive at the Control Chart from the Diagram of the 
Normal Curve 

From this diagram we easily arrive at tire quality control 
chart. All we have to do is to rotate it through a right angle 
and, using the horizontal axis as the time scale, scatter the 
observations of each interval in a horizontal direction according 
to the time at which each observation was made. We then 
obtain the diagram shown at the top of page IS. 

The perpendicular lines erected at the specified distances, 
1'96 and 3- 09 standard deviations, from the mean have now 
become two pairs of quality control lines, called the inner and 
the outer limits. Provided that the process is controlled by 
chance and the variations therefore due to nothing else but the 
influence of chance, with no assignable causes disturbing the 
chance variation, we should expect the frequency of observa- 
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tions within a certain range of values to comply with the per- 
centages represented by these lines. That is, we should expect 
approximately 1 in 40 samples to fall outside one of the inner 
lines and practically nothing to fall beyond the outer lines. 


3,4 Control Limits as the Boundaries Between Stability and 
Disturbance 

What is achieved by having control lines of this sort is 
that we have now a kind of gauge for judging regularity or 
consistency of performance. 

If the normal law, or any other law of chance, is adequate 
for the description of a certain frequency distribution, this 
means that the sole causes responsible for the scatter are chance 
causes. Furthermore, as such a complex of causes tends to be 
invariant, that is, to Remain what it is, we are satisfied that the 
scatter produced by such a set of unavoidable causes will not 
change as long as no assignable cause interferes, and we may 
therefore regard the process as stable under the given set of 
conditions. 

It goes without saying that it is essential in mass production 
for us to be able to assess the quality of the product in a reliable 
way. We should be able to depend on the quality varying 
no more than is unavoidable. It would not do to have the 
product one day conforming to a certain level and the next 
day conforming to another level. In other words, we want to 
be reasonably sure that the emergence of an assignable cause 
of variation — ^that is, a cause disturbing the working of chance 
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— should be spotted without delay, with a view to removing 
it quickly. 

Now, it is only by understanding the working of chance in 
bringing about variations that we are able to distinguish 
between unavoidable and avoidable ones. The first class is 
due to the working of chance, and we can never eliminate that 
source of variation. The second class is due to major causes, 
which can be isolated and made responsible for this or that 
disturbance. The range of variation which we have to tolerate 
because it is unavoidable is clearly indicated by the control 
lines. It can be maintained that, without having an idea of 
how far the influence of chance is felt in a certain process, we 
could never distinguish between what is an avoidable and an 
unavoidable variation. 

3.5 The Necessity for Economic Control of Quality as Distinct from 

Statistical Control 

The part of the control system which has been explained 
up to now is that of statistical control. It goes without 
saying, however, that a process may be under good statistical 
control without meeting the requirements of the management 
for a particular kind of process. It might go on steadily from 
day to day within practically the same control limits, indicating 
stability of production, but these limits might not be in con- 
formity with the specifications and the drawing tolerances. 

This points to the necessity for completing the statistical 
control by what may be called economic control. It is not 
enough to watch the regularity of performance from day to 
day ; the product must also meet the specifications, otherwise 
it may be of little use to the management. That it is not 
enough to consider only whether all the product is within the 
control lines, will become clear from what follows. 

We shall also see that our picture of the quality control 
chart has to be modified somewhat if it is to be of real use in 
practical work. 

3.6 Control Lines for Averages (The Principle Explained) 

One might conceivably reason like this since the tolerances 
define the admissible range of bulk quality, they should be 
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made to serve as 1/1000 control limits. The matter is, however, 
not quite so simple. 

The reasons for not regarding the tolerances as control 
■Umits are : 

(1) Such tolerances arc not always compatible with the 
inherent variability of the pi'ocess (see paragraph 2.4) ; 

(2) the distribution of individuals, to which the tolerances 
refer, is less normal, and therefore less controllable than 
that of averages ; 

(3) the established practice in Shop Inspection is to test by 
averages rather than by individual samples. 

In industrial woik it is not only the dimensions of individual 
articles which are recorded — and it is individual articles to 
which the control lines of Fig. 2.2 refer. The inspection pro- 
cedure usually consists in taking a sample of a number of 
articles and calculating the average. The obvious idea is to 
cover a greater part of the process, and also to reduce the 
freak variations which may occur in one or another article. 

Let us be a little more explicit about tliis. In industrial 
mass production we inspect for quality by taking not samples 
of 1 but of 5 to 10 if we measure, and of 20 to 60 if we gauge. 
The reason is obvious. If the rate of production is high — 
several thousand articles per hour, say — it is evident that a 
sample of 1 will not give an adequate picture of the quality of 
a batch of, say, 200 if taken every 10 minutes or so. We may 
hit upon an accidentally bad article which is in no way repre- 
sentative of the whole batch. If we asked for machine 
adjustment on the strength of that sample of 1, it would 
simply mean asking for trouble, and we would get it. 

Conversely, one satisfactory article might occur in a run of 
bad work, and should therefore not be taken as an indication 
that the batch from which it came was all right. 

Working with averages instead of with individual observa- 
tions tends to give a better foundation for either asking or not 
asking for machine adjustment. Taking samples of 10, say^ 
it is clear that the contribution due to the accidentally bad 
article Would in the average be swamped by the contributions 
due to the 9 others. Conversely, the contribution of an acci- 
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dentally good aiticle would be swamped by the contributions 
of the remaining 9 bad articles. 

But now an important question arises : We know how 
much deviation from the specification mean to tolerate in tht 
individual observation. The tolerance on either side of the 
mean tells us that. But how much are we to allow for the 
deviations in the average from the specification mean and yet 
be satisfied that the bulk of the product will come within the 
tolerances ? What is wanted is evidently control lines for 
averages. 

Now it is fortunate that long before anybody thought of 
applying statistical methods to industrial control, mathe- 
maticians had been successful in finding out by how much 
the averages of comparatively small samples picked at random 
from a great bulk of observations (called the “ population ”) 
are likely to vary among themselves, and also from the bulk 
mean or “ true ” mean. The information thus made available 
about the variation of averages, which was called the Sampling 
Distribution to distinguish it from the Frequency Distribution 
of individuals, came as a godsend when the above problem 
arose in mass production, viz. how far shall averages be allowed 
to vary if the product is still to be considered satisfactory ? 

Before we give a more detailed account of these matters, 
we can show in a graphical way that specification tolerances 
are not suitable for controlling averages. 

Let us assume that, as shown in Fig. 2.3 (page 19), 
averages are plotted on a chart having the specification toler- 
ances as control lines, and that the averages come right up to 
the upper limit. As each point represeirts an average of, say, 
6, the individual observations will be distributed around it, as 
indicated by the small circles. This shows clearly that in- 
dividual observations may fall outside the-specification, although 
the average lies within. 

As the bulk of the product consists of individual observations 
it follows that in order to safeguard against part of the product 
being outside the specifications, we want limit lines well within 
the specification limits. How much they should lie within, in 
order to fulfil that function, will be shown in the following 
, paragraphs. 
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Lower Specific. Tolerance 

Time Scale 
Fio. 2,3 

Specification Tolerances and Sample Averages 
Crosses denote sample averages 
Dots denote individual measurements 


2.7 The Relation Between the Distribution of Individuals and that of 
Averages (Quantitatively Explained) 

Let us assume that for a certain measurable characteristic 
of a certain type of article, the sjaecification value is given 
with a tolerance on either side. Let us call the specification 
value m and the tolerances and and let these latter dis- 
tances be placed symmetrically around the mean. Utilizing 
the information derived from experience, and used so widely in 
quality control, viz. that industrial measurements are, on the 
whole, distributed normally or very nearly so, we can assume 
that if the product was to meet the specifications it should be 
normally distributed around the specification mean witli the 
tails of the distribution approacliing the horizontal just at 
the distance of the upper and lower tolerances. In diagram- 
matic form it would appear thus : 




20 Q_UALITY CONTROL 

Remembering what was said above about the bulk of 
normally distributed articles lying within three times the 
standard deviation approximately, on either side of the Mean, 
our assumption becomes equivalent to saying that the standaftl 
deviation (commonly written o-) in a bulk of articles which 
meets specifications fully, is approximately -J- of the distance 
— ti. In symbols 



I'QGctj) on either side of the Mean (M) will then give the 1/40 
limit, and 3'09crj, the I/IOOO limit for the individual articles 
in the bulk. 

As has been observed above, in industrial inspection we 
work as a rule with averages. That is, we measure a number 
of articles, calculate the average, plot the averages obtained in 
regular intervals on a chart, and watch their distribution. The 
fundamental rule is that the averages should fall within the 
control lines in the assigned proportions. Practically no point 
should fall outside the 1 in 1,000 limit. 

If, however, we want to use the control limits for controlling 
the distribution of averages, the diagram given above (Fig. 2.4) 
has to be. modified. This diagram represents control limits for 
individual observations only — that is, for the observations 
forming the bulk of a product. What is wanted is control 
limits for averages. As we have already seen, in paragraph 
2.6, the distribution of averages differs somewhat from that of 
the individuals. We now want the general rule for transforming 
the control lines for individuals into those for averages. 
Fortunately, the relation is a very simple one, and one of the 
most important to remember in quality control work. 

As regards the form of distribution of averages, it happens 
to be a normal distribution or nearly so, even if the distribution 
of the bulk is far from normal. We can therefore apply with 
confidence the relation between distance and frequency given 
for the normal distribution by the Law of Errors. 

The difference between the distribution of averages and that 
of individuals lies, therefore,' not in the law which governs these 
distributions, but in something else. The distribution of 



21 


THE RAIL LINES OF CHANGE 

averages is closer packed around the mean, representing a 
curve like the narrow curve in Fig. 2.5, and its standard 
deviation is reduced compared with that of the distribution of 
irinividuals in the ratio 1/ where n is the sample size. 



Frequency Distribution and Distribution of Averages 


The quantity is called the Standard Error of the Mean. 
Vn 

Considering that it is nothing else but the standard deviation 
of the frequency distribution of averages or means, it is clear 

that two times that standard deviation or and three times 

Vri 

that standard deviation or — will represent approximately the 

Vn 

distance from the mean of the 1 /40 and 1 /lOOO limits respectively, 
for sample averages. 

It follows that if we write for the standard deviation of 
averages of samples of n coming from a bulk which conforms 
fully with the specificadons, the expression (from 2.1) 




Vn 




( 2 . 2 ) 


then l‘96cr^ will give the 1/40 and the 1/000 limit fox 

averages to be placed symmetrically on both sides of the 
specification mean. 


2.8 How Control Limits are Obtained from Actual Samples ^ 

Up to now we have been only concerned with the specifica- 
tion built, that is, with control lines for 100 per cent, perfect 
product. We must now turn our attention to the actual quality 
( 612 ) 
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of the product which is to be ascertained from periodically taken 
samples. 

Let us assume that actual samples were taken over a con- 
siderable period of time. For each of the samples we then 
calculate the average, and from all these averages what is called 
the grand average X, which represents our mean line. The 
control lines may be calculated in different ways. The direct 
way would be to calculate the average standard deviation 
within samples, s,^ (not that for the total distribution which 
possibly includes the effect of variation between samples), 
transform it into the population standard deviation, o-, by divid- 
ing it by a factor, listed for various sample sizes in Appendix 
III (a) and derive the control lines as the 1/40 and 1/1000 limits 

... l-96cr , 3‘09o- 
lor averages, using the expressions — =- and — =- respec- 

■Vn Vn 

tively.* The simplest way, however, is to obtain them from 
the range. The range is the difference between the greatest 
and smallest dimensions in one sample of n specimens. From a 
number of sample ranges, say 50, the average range is calcu- 
lated and then multiplied by a factor (A'.o 26 ) given in the 
table. Appendix IV, to obtain the inner, and by another factor 
(A'-ooi) fo obtain the outer, limits. For samples of 6, for instance, 
these factors are -377 and -694 respectively. These are the 
distances to be set out on both sides of the average X to give 
the control limits for the averages. 

Mnmerieal example for obtaining control limits. (To be set off 
from Grand Averages in the i directions.) 


Sample 

No. 

Sample Qualities 

Average of five 
Specimens 

Ranges 

(1) 

5 

3 

2 

4 6 

4-0 


1 

4 

(2) 

6 

7 

5 

4 S 

e-4 


3 

(3) 

3 

6 

3 

4 5 

4’2 


3 

(4) . 

8 

3 

7 

5 4 

6-4 


5 

(5) 

6 

2 

G 

4 6 

4-6 


4 





Grand Average, X 4‘72 Av. Range, 

w 3-8 


* see Appendix II for the calculation of Mean and Standard Deviation 
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(a) Control limits obtained from average range^ 

1/40 Control Limits : x A'„, 

For our example : 3'8 x -377 = 1'43 

1/1000 Control Limits : x A'„j, 

For our example ; 3-8 x -694 = 2'26 


Sample 

No. 

Differences from respective 
sample means 

Sums of 
squared 
deviations 

Standard deviations 
(i) obtained as square 
roots of mean sums 
of squares 

w 

11222 

10-0 

1-41 

(2) 

■6 1-6 -4 1-4 -4 

6-2 

1-02 

(3) 

1-2 1-8 1-2 -2 -8 


1-16 

(4) 

2 'C 2'4 1-1 -4 1-4 

17-2 

1-96 

(6) 

1-4 2-6 1-4 -0 -4 

11-2 

1-50 




s . = 0-05 : 6 = 1-39 


[b) Control Limits obtained from average standard deviation 

■within samples. 

1/40 Control Limits : Jg X A .„5 

For our example : 1-39 X 1‘04 = 1-45 
1/1000 Control Limits : jg X A,„j 

For our example ; 1-39 x 1-64 = 2-28 

(The differences in tlie second decimal place are due to the 
small number of observations and to the range function apptying 
strictly only to normal universes.) 

The factors by which to multiply the range are calculated 
by utilizing the above-mentioned relation between the dis- 
tribution of individuals and that of averages. 

The way in which they are obtained can be explained 
briefly as follows. 

We know how to arrive at the standard deviation of the 
bulk from the sample range, because the function which connects 
these quantities is known. We merely divide the average sample 
range zo by a certain factor, d„, which is listed in the table, 
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Appendix III, and which for samples of 5 is 2-326. We further 
know that 3-09 times the standard deviation will give half the 

, ,, , , r 3'09»„ 

range of the individuals in the bulk, and theretore ^ yjjfd 

or for samples of 6, will give half the range of 

the averages. Now it is just these values of 3-09 divided by the 
dn X V^. which are listed in Appendix IV for the 1 in 1000 limits, 
and a similar deduction applies to the 1 in 40 limits. 
These are the “ factors ” by which to multiply the average 
range «)„. If instead of the range the standard deviation is 
calculated for each sample of n, the A-factors of Appendix IV 
should be used for transforming the average standard devia- 
tion in samples of into the distances of the control lines. 

It is emphasized that the limits should be recalculated 
from time to time, say every SO samples. 

2.9 The Relation Between the Variation in the Bulk of Articles 
Produced and the Control Limits Obtained from Samples— The 
Rail Lines of Chance 

We now understand how the actual control limits for 
averages are arrived at, and have now to decide whether these 
control limits comply witir the specifications. They will do so 
if they are in such a position that if aU averages fall within these 
limits, the total of the bulk will be within the specifications. 

This will be the case if they more or less coincide with the 
1(40 and 1/1000 limits respectively for sample averages fi-om a 

100 per cent, perfect bulk (see paragraph 2,7).^ 

It is recommended that these theoretical limits, which are 

t it t ^ 11 . ^ 

approximately obtained as i i should 

be put together with the specification mean into the margin 
of the chart, and that they should be compared with the actual 
limits obtained from the samples as regards absolute distance 

and position. . 

Conversely, if we wanted to arrive at the true bulk limits 
from the control, limits, all we have to do is to multiply the 
distance between the upper and lower 1 in 1000 limits by 
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the square root of the sample size, and to compare that distance 
with the specification. The distance should not be greater than 
that provided by the specification tolerances, and the position 
cf that distance should approximately be that required by the 
specification — ^that is, symmetrically around the specification 
Mean. 

Furthermore, assuming a bulk of perfect product — that is, 
product within the specification — ^we know that the standard 


deviation in such a distribution is approximately 


tu- h 

6 ■ 


We 


also know that, in order to arrive from the standard deviation 
at the average range of samples of n in such a bulk, we have 
to multiply it by a factor du given in Appendix III. Doing 
this, we arrive at a figure for the average sample range in such 
perfect product. This affords a sort of gauge value for the actu al 
average range, which must not be greater than that gauge value 
if the product is to be within specifications. For samples of 4, 

say, we have u54 = 0 ' x 2-06, or — - X 2-06 = -SS — t,) 

bT8 

(6T8 is the value which corresponds exactly to the 1/1000 
probability level). 

As a routine test we should, as has been shown in paragraph 
2.7, arrive at what may be called ideal control lines by calcu- 
lating them from o-^, the standard deviation of averages from 
the perfect bulk, put them as dotted lines on the chart or in the 
margin, and compare their position with the control lines 
actually obtained. . If the product is to be satisfactory, then the 
actual control fines should be within the band indicated by the 
dotted fines, and the proportion of points inside and outside 
the 1/40 and 1/1000 limits in conformity with the denomination 
of these fines. 

All these ways of satisfying ourselves as to the quality of the 
product are only different expressions of one and the same re- 
lationship, viz. the relationship between the standard deviation 
in the frequency distribution of individuals and in that of 
averages. « 

Only if the actual control limits comply with these require- 
ments, with no more than approximately 1 in 40 and 1 in 1000 
averages falling outside the corresponding limits, are we justified 
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in saying that the process is not only stable, but the product also 
within specification, and only then will the control lines serve 
as safe “ Rails of Chance.” 

They may be looted upon as “ Rails of Chance,” becausS, 
separating for a given process the area of averages which is 
compatible with the specifications from the “ forbidden ” area 
for such averages, they define hereby the “ railway ” for the 
process to move in. They tend to make the control inspection 
independent of the chance fluctuations of individual measure- 
ments by eliminating the effect upon the judgment of the 
examiner, of the pebbles, cobble-stones, and even stumbling 
bloeks of freak deviations, and thus enable the process to run 
smoothly and uniformly as if “ on rails.” They are guiding 
lines for inspection and, thereby, for tire process itself, points 
falling outside the control limits indicating the need for action. 

2.10 Assignable Causes Versus Chance Causes — ^Examples of Charts 

We are now in a position to distinguish between two kinds 
of assignable causes — statistical and economical. These two 
causes are not absolutely different from one another, although 
they represent different concepts. A statistically assignable 
cause may also be an economic assignable cause, but need not, 
and vice versa. We can, however, think bf configurations on 
the charts which would allow us to distinguish clearly between 
these two types of causes. 

To fix our ideas let us think of the manufacture of cartridges. 
Into the outer base of the cartridge case the circular “ cap ” 
is inserted containing priming composition. The correct depth 
of insertion is of very great importance for the functioning of 
the carti'idge. Let us assume that the specifications for that 
characteristic demand that, measured from the base of the 
case, the cap should not be neaj’er than 9 thousandths of an 
inch and not farther than 16 thousandtlis of an inch. We can 
express tins by saying that the specifications are -012 in ± -003 
in. From the total admissible range of *006 in. the 1/1000 
,fontrol limits for samples of 6 from the specification bulk 
were obtained according to paragraph 2.7 and put into the 
margin of the chart as guiding lines in Figs. 2.6 to 2.14. 
Each dot on the diagrams represents an average of 6 observed 
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depths of intrusion, the samples of 5 being taken in intervals of 
one quarter of an hour. 

In the following diagrams the 1/1000 limits are represented 
by fuU, the 1/40 limits by broken lines. 

(fl) Statistical and Economic Control . — The following chart is an 
example of a process under control, statistical as well as 
economical, because not only are there no points beyond the 
1/1000 limit, but the position of these limits satisfies the gauge 
lines in the margin of the chart. 


yiOOO Limits 
for 

100^ 12- 

perfect 
product 
11 - 


Fig. 2.6 

Control Chart Picture of Process which is under Statistical 
and Economic Control 

(The change from dots to crosses or circles in this and the following diagrams 
indicates a change in shift.) 


(i) Lack of Statistical Control between Samples of n Items . — The 
following chart shows lack of statistical control, because a 
considerable number of averages fall beyond the control limits. 
Be it observed that the variability of the product within samples 
which is epitomized by the distance between the 1/1000 limits 
is not far from what it should be according to the gauge limits 
in the mai'gin. 


/^ 3 - 
'/lOOO Limits 
for 

100°^ 12- 


perfect 

product 




Fig. 2.7 

Control Chart for Averages of Process lacking Statistical Control 

(c) Lack of Statistical Control between Sub-groups . — Let us 
Suppose that the control chart has been for some time in 
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operation, and that the control 
limits were recalculated at certain 
specified intervals, for instance, at 
the beginning of each shift or wiih 
each new batch of components put 
into production, and let us further 
suppose that the control chart 
represents a picture as in Fig. 2.8, 
with the control limits varying fi-om 
one of these specified intervals to 
the next. 

We would then conclude that 
there was a lack of stability, and 
so infer the working of assignable 
statistical causes, taking the product 
as a whole, and we could also 
discover the type of assignable 
cause. If the intervals were shift 
intervals we would infer that the 
operation differed from one team 
of workers to another. If, on the 
other hand, the intervals repre- 
sented different batches of compo- 
nents, we would infer from a chart 
like that above that there was a 
considerable variation in the quality 
of components. But it should be 
noted that if the control limits were 
all in a position as required by the 
1/1000 lines derived firom the 
tolerances (the gauge lines), then 
the statistical causes of disturbance 
would not be economically disturb- 
ing causes. 

(flf) Lack of Economic Control due 
to Faulty Setting. — The chart, after 
having been for some time in opera- 
tion, may have this appearance : 
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Upper 
l/IOOO^ 
Llm(t 
for 100^ 
** perfect 
produce 



12 — Fig. 2.9 

Control Chart for Averages showing Fault in the Setting 


Here we see that the control limits, although stable through- 
out and therefore not admitting the inference of a statistically 
disturbing cause, are yet not in a position which would promise 
an economically desirable product. Instead of being placed 
symmetrically around the mean (12) they are on one side 
(upper side) so near the specification that part of the bulk 
of the product must be expected to lie outside the specifications. 

There is therefore an economic assignable cause present, 
and apparently the fault lies with the setting. The machine is 
set on the high side. 

(e) Lack of Economic Control due to Increased Variability. — An- 
other assignable economic cause of disturbance can be inferred 
from a diagram like this ; 



Control Chart for Averages showing a Process which is under Statistical but not 
under Economic Control 


Whereas in Fig. 2.9 the absolute distance between the 
control limits was considerably less than the specification range, 
indicating that it was not the variability of the machine but 
only the setting which was at fault, we find in this diagram 
that the distance between the control limits is so wide as to;, 
bring these limits right up to the specifications. From what 
was said above, this means that a considerable part of the bulk 
must be expected to lie outside the specifications, because, as 
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will be remembered, the distance between the control limits 
should be reduced against the specification tolerance in the 
ratio 1 divided by y/n. 

Translating this diagram into physical terms, it means tlmt 
the variability of the machine is too great, which may result 
from tools in the machine being worn or parts having got loose. 
In order to verify conclusions like those mentioned in this para- 
graph, which were arrived at by mere inspection of the chart, 
we have to apply the methods given in the preceding paragraph. 

For instance, the ideal control limits derived from the 
tolerances which we put into the margin of the chart, together 
with the specification mean, enable us to check the quality of 
the product at a glance, not only as regards its general level 
but also as regards its variability. For more precise informa- 
tion we would apply one of the numerical methods. 

2. II Modified Technique of Control* 

In this paragraph useful modifications of the Control 
Chart method will be outlined for the extreme cases where 
the variability of the product was 

(1) greater than admissible according to specifications, and 
could by no engineering or other means be further re- 
duced ; or 

(2) smaller than required by specifications. 

(1) Let us assume that all the efibrts of the production man 
to reduce the range of manufactured product with regard to a 
certain characteristic have been of no avail. Simply to abandon 
the Control Chart method is obviously no real way out of the 
difficulty, because this would not by itself improve the quality 
of the product. What we must try in such a case is to make 
control feasible and useful by either changing the specifications 
or the method of control. 

If it was essential that the characteristic should be con- 
trolled around the specification mean, both tolerances being of 
^equal importance, ■ then there is no other way of bringing 
actual production and specifications into line with one another 
*see First Guide qf Quality Control for Engimm, Ministry of Supply (H.M.S.O., 
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than the reconsideration of the latter, and a suggestion to that 
effect should be forwarded to the management. 

If on, the other hand, it was not the mean but only one of 
tlBe tolerances which was of vital importance in order that the 
product should be usable, then the following modification of 
the control method is recommended. 

It can be used, for instance, with success in the process 
of cap insertion if, whether for reasons in the components or 
for engineering reasons, the sample range became incompatible 
with the specifications. Considering that insufficient depth of 
insertion is a rectifiable fault, whereas excessive depth is an in- 
curable one, the upper tolerance is regarded as the “ vital ” 
one in the sense of the method now to be explained. 


Let us keep in mind ; 

(a) that if practically no part of the product must fall 
beyond the vital tolerance, the mean of the distribution of 
measurements must lie — ^normality assumed — at a distance 

of 3-09 ct or 3-09 i w from that tolerance. The value of our 

un ^ 

variable obtained in this way we regard as the mean, X, around 
which we now try to control the characteristic. 


(6) that if the 1/1000 limits are obtained in the usual way, 
by measuring the distance A'a}„ from that Mean, X, they 
represent the allowable range of sample means if practically 
no individual measurements were to fall beyond the vital 
tolerance. These, then, are the control lines by which to work. 

We can express this in a somewhat different way. If, no 
matter how great the average range, in our samples of n was, 
we wanted to exclude the possibility of individuals falling out- 
side the vital tolerance, be it the upper or the lower, then the 
1/1000 limit must not come nearer to our tolerance than the 

j. Bulk Range — Range of Means . ,1 o nn ^ — 

distance 2 — 2 m symbols, 3’09 w 

2 a„ 


- a: 


OOl'^n) 


or w„(3‘09^ A .floi) 


because even the extreme- 


outliers of the distribution of sample means, lying close to the 
1/1000 limit, do not contain individuals more than 3-09or or 



32 QUALITY CONTROL 

S'OqI »„ distant from the mean of the parent distribution 

dn 

(normality of the distribution assumed), wlrich deviation would 
exceed the vital tolerance. # 

The factors (3-09 ^ — A'.o^]) denoted by the letter M, by 
dn 

which we multiply the average range, w, in order to obtain the 
required distance from the vital tolerance, are listed in a little 
table at the end of this paragraph for samples from 2 to 10. 

Adopting this version of the control method, we measure 
the quantities so obtained as distances from the important 
tolerance (the upper, in our case), inwards, wlrich gives our 
theoretical 1/1000 control line. As that limit is always A! 
distant from the distribution mean, X, "we obtain the latter 
by measuring A'.Q(|,a/„ from the control limit inwards, that is, 
towards the other tolerance. 

This modification will be found extremely useful under the 
circumstances described above, provided any change in the 
variability of the product is immediately taken account of by 
recalculation of the 1/1000 limits and the worldng mean. 


I /IOOO 


I3'0— 


12-i— 


n o 

spec mean 


ll-S — 


•U-O— 

l /lOOO 

ip-5- 


lO'O-. 


9-S~ 


,^UPP£S VlOOO LIMIT CALCULATED PROM OBSERVATIONS'-. 


>V.^EORETICAL UPPER ■ 
VlOOO LIMIT * 


, + 'Ao 


< OBSERVED AVERAGE o 


PREOlCTeO AVERAGE 



“'Ao 


THEORETICAL LOV/ER_,.R»^ OBSERVED '/lOOO 

'/lOOO LIMIT 

Fio. 2.11 


Control Chart of Trocess which has to be controlled with regard to the Upper 
Tolerance and around a Working Mean (as different from the Specification Mean) . 
Observe how cloSely the average and 1/1000 limits calculated from the observa- 
tions approach the theoretical average and control lines 
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(2) Where the variability of the product is considerably 
smaller than required by specification tolerances, the control 
chart will often show a fluctuating mean. This, however, 
sh^iuld not be taken as a peculiarity of the distribution, which 
is better than it need be. But it is only in a case like this that 
we are in a position to make allowance for a fluctuating mean 
without employing too elaborate methods. A case like this 
arises in the process we have chosen as an illustration, if both 
the quality of the components and the precision of the machine 
are above average. 

1 4^0 

'/lOOO 

130 


I2'5 


12-0 

Mean 


ll-S 

11^ 

'/lOOO 

ID'S 

Control Chart for Averages .showing Economic Control in spite of lack of Statistical 
Control (Fluctuating Average) 



Fig. 2,12 


According to the strict letter of instruction, the quality 
controller would have to advisq adjustment of the machine or 
alteration of the operation whenever a point falls outside the 
1/1000 limits, which happens fairly often in such a case if the 
limits are obtained fi:om a great number of samples. He 
would then probably meet with the objection on the part of 
production or those servicing the machine that there was no 
need for correction as the product was within specifications. 
Looking at the control lines in the’ margin of the chart, we 
could not help admitting that the distribution of plotted points 
did not exceed these limits for 100 per cent, perfect product 
in spite of the instability of the process. 

It is thus obvious that the 1/1000 control limits obtained 
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from an average range, w, which was better than the specifica- 
tions require it to be, would tend to make the control of the 
characteristic stricter than it need be, beeause it does not 
tolerate instability even though it was not harmful to 'die 
quality of the product. 

In order to avoid such unnecessary hardship, we use a 
somewhat similar device as in case (1). There we removed the 
1/1000 limit a suitable distance from the tolerance, and here 
we shall make it approach the tolerances as much as is com- 
patible with bringing practically none of the product outside 
specifications. 

We can allow for a shifting mean by working to the extreme 
1/1000 limits that could result from the observed variability 
of the product. Such limits arc obtained by measuring from 
each of the tolerances inwards (towards the centre of the dis- 
tribution) the distance w„ (3-09 i— A'. qoi) or w„M, because, 

assuming a constant variability smaller than required by speci- 
fications, no sample of n whose average falls within these limits 
will contain individual measurements outside specifications. 
These limits are simply tire 1/1000 limits of the partial dis- 
tributions around the extreme mean values wliich can be 
tolerated if practically nothing is to lie outside specifications. 

Here, too, the warning must be added that the method will 


EXTREME POSITION FOR UPPER '/’lOOO 


CALCULATED '/lOOO 


'M£AN 


CALCULATED 


'/lOOO 


EXTRF-ME POSITION FOR LOWER '/lOOO 

Fjg. 2.13 

Gonttol Chart for Averages with 1 /IQOO limits set in from Specification Tolerances 
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be found useful only if the variability of the product is checked 
at suitable intervals, and the position of the limits recalculated 
should changes in that variability occur. 

Table 2.1 

Factors M by which to multiply the average range, in 
order to obtain the distance nearer than which the 1/1000 limit 
is not to approach a tolerance if the product is still to be within 
specifications (to be applied only if the precision of the process 
is higher than required or implied by the specifications, and (para. 

2.11 (1)) where the process is controlled around an artificial or 
working mean shifted so as to suit the excessive variability of the 
process). 


4 

■76 


-73 


6 

•72 


7 

•71 


No. in Sample (h) 2 3 

Factor (M„) -80 -77 

(Factors are calculated according to the formula : 

^ Bulk Range — ^ Range of Sample Means 

Wii 

- /' 3 - 09 1 ^.. 3-091 


= (- 


j.ia Trends 


d„ 

3-09 ,, 




I)/ 


Wn 


8 

•70 


9 

•09 


10 

-09 


Before we leave the subject of charts of averages, there is 
one more point to be discussed, and that is trends of plotted 
points on the chart indicative of certain undesirable conditions 
of the process. 

The basic rule for quality control, that the plotted points 
should be within the control limits, and that the control limits 
should be in a position satisfying the specifications, needs to 
be completed by certain minor rules which will prove helpful 
for spotting trouble in time. 

A chart may present a picture like this : 
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In tills chart none of the points plotted falls outside the 
control limits which, let us suppose, arc in a satisfactory position 
with regard to specifications, but in the left half we notice a 
definite trend of the points towards the upper limit. A pictilre 
like this indicates that the processs has lost stability, and that 
the product will be unsatisfactory if this trend is allowed to go 
on. If such a trend is observed, the machine must be stopped 
and the cause of the trouble investigated. As a rule it will be 
a variable cause which is correlated with the characteristic 
measured. Such a correlation disturbs the normal variability 
of the product and should not be tolerated. 

In the right half we see that the product, although within 
the control limits, has all piled up towards the upper limit. 
As the upper limit is, according to our assumption, in a satis- 
factory position, this does not mean that part of the product 
will be out of specifications, but it does indicate that it will be 
all on the high side of specifications, and such components have 
been found unsatisfactory in a subsequent assembling process. 
One should aim at a symmetrical distribution of the points 
around the mean, thinning out towards the control lines. 
The discovery of trends is made easier by working with averages 
instead of with individual measurements. Fluctuations within 
samples are averaged out in this way, and the real differences 
between samples brought out more clearly. The subject of 
“ Trends ” will be fully dealt with in Chapter V. 

2.13 Range Charts 

From what has been said hitherto, the importance of the 
range has doubtless emerged. As so much depends on the range 
of the product, and as indeed all the control lines on the average 
chart are based upon the range, it would seem desirable to 
control the range independently for its own sake. This is 
actually the case, as range charts are usually run in conjunction 
with average charts. The range chart then provides indepen- 
dent information about the process. Whereas the chart of 
averages provides primarily, apd taken by itself, information 
as to whether the level of the product was in control, the range 
chart enables us to judge whether the variability itself was under 
control. It should be remembered, however, that the range 
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enters into the construction of the control lines for averages. 
These, as we have seen, are obtained from the average range 
of samples of k, and their distance and position give valuable 
inlbrmation about the spread of measure- 
ments. 

Control limits 1 in 40 and 1 in 1000 
are calculated for the range charts accord- 
ing to a method analagous to that 
described for the chart of averages. They 
are obtained, like the control limits for 
the average chart, from the average 
range by multiplying that quantity by 
certain factors given in the table, Ap- 
pendix IV.* The statistical meaning of 
these limits is the same as that of the 
control limits on the average chart, that 
is, they represent the 1/40 and 1/1000 
limits for sample ranges of n specimens. 

As the control limits for the range 
vary with the sample quality, viz. are 
pushed up by bad and lowered by good 
quality, it is essential to consider not 
only the question of statistical control 
of ranges, but also of economic control. 

A product which keeps well within the 
1/1000 limit for ranges may be economi- 
cally undesirable and disqualified as 
“ highly defective ” if those limits have 
been obtained from highly defective 
samples. 

To have guidance limits in the margin 
of the chart whose position can be com- 
pared with that of the limits resulting 
from the observations is therefore just 
as advisable as it was for the chart of ~ ® 

averages, if not more so. Such limits are obtained in quite a 
simple way. We know that the distance from the mean of the 

* Factors for obtaining such limits firom the Average Standard Deviation 
in Samples of n (i„) are given in the left half of the table 
(CIS) 
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Fig. 2.15 Control Chart for Ranges 
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1/1000 limits for averages from the specification bulk (let us 
call it D) is given by 

Therefore ■ rZ)„ = -1^"-, where D„ = \ 3 

A .QQi 

This, then, is the average range in samples of n which is 
required if the product is to comply with the specified level. 
Multiplying this cjuantity, £t)„, by the proper factors listed in 
the table, Appendix IV (b), D'-boo, we obtain the 1/1000 limit 
for ranges in samples of n which is compatible with the speci- 
fications. 

2.14 Control Method if the Range is Specified Independently 

In general we thus derive our estimate of the correct range, 
(and the limits for range) from the specification tolerances for the 
level of die dimension in question. That is to say, our theoretical 
limits for the range are obtained from a statistically derived 
average range in samples of n. 

There is, however, a type of specification which in addition 
to specifying the required level of a dimension also specifies 
the maximum range in a given number of determinations. In 
such cases it is not only possible but necessary to base our 
estimate of the required average range in small samples on 
that maximum range allowed by the specifications. This 
estimate is then used for arriving at the theoretical 1/1000 
limits for averages. We have thus the reverse of the usual 
method. 

This involves using a system of control similar to that out- 
lined in paragraph 2.11, point 2, for arriving at the theoretical 
limits for 100 per cent, perfect product. 

Specifications may be of the following type : a range is 
given .within which a number of determinations made on one 
and the same individual article for a given characteristic are 
'’allowed to vary. At the same time it is stated that the highest 
and lowest determination are not to differ by more than a 
specified amount. 
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As an illustrationj let us assume that a paralleling lathe 
operation is carried out on brass discs or rings (cylinders of 
short height), and the specifications are that the thickness of 
thp disc or ring, measured at any point along the periphery — 
that is, tlie distance between the upper and the lower surface — ■ 
can be anything from -406 in. to *415 in., but with the proviso 
that the maximum and minimum determination on one 
individual article is not to differ by more than -004 in. Let us 
further assume that the number of determinations to be taken 
on each article is fixed at eight — that is, tlie thickness of the 
cylindrical object is measured at eight equidistant points along 
the periphery. 

In order to arrive at the control chart of averages and 
ranges, we consider the eight determinations taken on one article 
as being so many samples, and say consequently that we are 
working with samples of 8, whose averages we plot on the chart 
for averages and whose ranges we plot on the chart for 
ranges. 

The difficulty now is this : the specification range for single 
measurements, *405 in. to *415 in. = -010 in., is in this case 
evidently independent of the specified required range be- 
tween any co-ordinated 8 determinations, that is between the 
measurements taken on one and the same article, which is 
•004 in. 

The 1/1000 limits, calculated from the former for samples 

of 8, and resulting in i = ±-0018 in., 

Wn 2V8 

bear no direct relation to the allowed amount of non-parallelism 
of -004 in. per article. 

This is shown by the following simple consideration. The 
average range w in samples of 8 which would correspond to 
the distance, of the 1/1000 limits from the specification mean, 
viz., ±1-8 thousandths inch, is evidently l-8-~A'ooi or 
1’8 q- '384 = 4'7 thousandths inch. But we are supposed to 
work to a 4*0 thousandths inch maximum range within samples 
of 8, and if the specifications are correct — that is, in conformity 
with what can be attained in actual work. — ^we should be able' 
to do so. The maximum range clearly has the function of the 
upper ] /lOOO limit on the chart for ranges (see paragraph 
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2.13), and for samples of 8 we can easily obtain the average 
range in such samples by dividing the value of the limit, 
0-004 in., by the factor for samples of 8 (see table, 

Appendix IV {b)), viz. 2-04, which gives -002 in., sensibly. This 
is considerably less than the value for the average range result- 
ing from the first part of the specifications, viz. -0047 in. The 
implication is that the specifications make due allowance for 
the mean to vary to a somewhat greater extent than can be 
accounted for by fluctuations of sampling only. It follows in 
the light of what has been said in the preceding paragraph 2.11 
that we should allow the averages, and thus the 1/1000 limits, 
to approach nearer to the specifications than would correspond 
to a 4-7 thousandths in. range in samples of 8, namely, as near as 
corresponds to a 2thousapdths in. average range in such samples. 

All we have to do is to determine the distance from each 
tolerance at which the 1/1000 limit (and, therefore, the plotted 
averages) could still lie if the product was to be within specifi- 
cations. The extreme averages, if they are still to comply 
with the specifications, should be distant from the tolerance 
by not less than ^ bulk range — | sample average range, or 

Q.nn„ o.nn o 3-09 Wg 3-09 fiig 3-09 X ’002 3-09 X -002 

' ^ V7i d, dsVS 2-85 2-8 x 2-86 

= -00271 in. — -00077 in, = -0014 in. 1-4 thousandths in., 

then, is the distance farther than which a 1/1000 limit, 
obtained from actual work, and set in fi'om tolerances is 
not to be distant from either tolerance if the product is to 
comply with both parts of the specifications. This, as -will 
be seen from the folio-wing diagram (Kg. 2.16), increases 
die allowable range for sample averages by more than 
3 thousandths in., and prevents our having to ask for machine 
adjustment vdthout the product falhng outside specifications. 

But these “ 1/1000 limits ” have here a somewhat different 
meaning from those encountered in paragraph 2.11. 

There they were calculated from the observed average range 
in samples of n, which happened to be smafier than the average 
range for such samples when obtained theoretically from the 
specifications. They represented the ” top lirhits in either 
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direction from the mean for either averages or 1/1000 limits 
obtained from the averages ; anything within was acceptable. 
They may be said to have the function of being the “ limits 
of* limits/’ and remain what they are as long as the average 
range is the same. 

Here, however, they are obtained from a range which 
explicitly is part of the specifications and as such is constant. 
In our case, it was the average range between any eight 
measurements taken on one and the same article, viz. -002 in. 
The actual average range in such samples of eight determina- 
tions may be better or worse. If better, the limits will be 
nearer to the specification tolerances, if worse, they will be 
farther away. The “ 1/1000 limits” which we obtain from 
both parts of the specifications and, as usual, put into the 
margin of the chart as guiding lines, have thus here the function 
of a “ bottom limit ” in either direction from the mean as far 
as observed 1/1000 limits are concerned (not as far as single 
averages are concerned). 

A 1/1000 limit obtained from the actual observations 
according to paragraph 2. 1 1, and set in from the specification 
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tolerances must not approach the specificatioti mean nearer 
than those limits. If it does it is not acceptable, because it 
implies a greater range than that specified. 

The theoretical limits which we obtained here may be sslid 
to have the function of being “ limits to modified limits ” or 
bottom limits to the top limits of 1/1000 limits. 

The following control charts for average and range of 
samples of 8 thickness determinations per ring, the sampling 
incidence being one ring every half-hour, will help to make 
this somewhat involved relationship clearer (Fig. 2.16). 

2.15 Variation “Within” and "Between” Samples (or Sub-groups) 

In drawing conclusions from the range chart we must 
always be careful as to whether the variations in range occur 
within or between the sub-groups or the samples of n specimens. 

Sub-groups of samples may be made, for instance, according 
to batches of components or according to shifts. If we find 
the average range varying significantly from one of these sub- 
groups to the next, we would conclude in the first case that 
there was a difference in the quality of components so far as 
it affects the characteristic for which we measure, and in the 
second that there was a difference in operation, one team of 
workers producing a product of greater uniformity than 
another. 

If, on the other hand, the variations within the batches 
of components, or within the teams of operatives, was ap- 
preciably greater than that between these sub-groups, it 
would indicate that the condition of machine tools was not 
satisfactory, or that parts of the machine had become loose. 

Equally important is the distinction between variation 
" within ” and “ between ” samples of n. In general, if the 
variation “-within” was greater than that “between” it 
pointed to instability of the machine ; if it was the other way 
round, it pointed to instability of the operation. 

This very important matter will be discussed in greater 
detail in Chapter IV. Be it noted here that die possibility of 
distinguishing between those two kinds of variation as afforded 
by the control chart method is due to the working with 
sample averages instead of with individuals. 
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2.16 The Meaning of Working with Averages in Industrial Control 
Inspection 

It is very important for the understanding of the quality 
cc*Qtrol chart to realize how closely is the working with averages, 
instead of with individual observations, bound up with the 
problem of distinguishing between chance and assignable 
causes. 

We have referred to the taking of samples of a certain size 
at specified intervals, temporal or numerical, as the sub- 
grouping of observations. Let us be clear about the meaning 
of the term “ sub-group.” 

Arranging individual articles into one frequency distri- 
bution by recording the number of measurements falling into 
successive class intervals of our scale is called “ grouping the 
observations.” 

We may, however, prefer to take a number of observations, 
say n, or, in other words, a sample of n units, and group these 
samples according to some quantitative characteristic. This 
means that we sub-divide the total into as many secondary 
groups as there are samples of n units. 

The term “ sub-group ” is also used for batches of product 
proceeding firom different sources such as different shifts, 
different machines, different lots of components, etc., and helps 
to differentiate between such collections of samples as are 
intermediate between the total and the proper samples of 
n units. 

The question now arises : How do we know what sub- 
group to use ? What shall be the sample size and what the 
interval — also called the sampling incidence — ^and why make 
sub-groups at all ? 

Now, the second type of sub-group referred to above gives 
us a hint for answering these questions. 

Within a shift or within the product coming from one 
machine only, or within one lot of components, there should 
be no variations apart from those due to chance causes which, 
in a certain sense, are unavoidable variations. But it is quite, 
conceivable that the operation may differ from shift to shift 
because one team of operatives may work better than another ; 
that a machine might have worn tools and work Jess reliably 
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than others ; and that lots of components may dilTer in quality. 
The sub'division into batches of samples coming from con- 
ceivably different sources, and the comparison of the average 
cjuality of these batches, will now make it possible to differentiate 
between them with regard to the quality of the product, and to 
locate trouble at the source as a preliminary to improvement. 

This is also the principle of the further finer sub-grouping 
by talcing samples of n and working with their averages. 

These sub-groups (in samples of n units) should be chosen 
so as to break up the total into groups of .such size and intervals 
that within them variations may, for engineering reasons, be 
considered to be due to chance causes only, but between which 
there may be differences due to assignable causes whose 
presence is considered possible. Working with sub-groups and 
samples of n instead of with the total and the individual 
observations, then, enables us to distinguish between chairce 
and assignable causes. 

' The full power of the method of working with averages 
instead of with individuals will become apparent in Chapter IV. 

The fundamental principle of the control chart technique 
is thus that we take small samples, fi'om 4 to 10, in specified 
regular intervals, and ascertain whether, in spite of the non- 
randomness of sampling, the total of samples complies with 
the requirements of randomness (judging by the behaviour 
of sample averages and measures of spread). If so, we are 
satisfied that we are dealing with a chance cause system, or, 
as we say, that the process is under control. 

With this purpose in view, we make the sampling incidence 
(intervals between samples) such as would most likely show 
up any cause of disturbance or non-randomness. These 
intervals we specify according to what is known about the 
variability of the process, that is, according to the time, or 
number of manufactured articles, it would take the suspected 
disturbing cause to make itself felt upon the characteristic in 
question. ' ' 

When working to specifications we compare the position 
and distance of the observed control lines with those for the 
100 per cent perfect distribution. That is, we compare the 
Wo means and the two standard deviations, But only under 
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one condition are we justified in drawing the conclusion that 
if mean and standard deviation are sensibly the same, the 
product will comply with the specification. 

» This condition is that the same statistical law governs both 
the theoretical and the ob.served distribution. It is fulfilled 
for our case if the process is under control, because this implies 
that it is subject to the Normal Law, and thus follows the same 
type of distribution which wc assume to prevail in the specifica- 
tion bulk. Lack of control is indicated by points falling outside 
the control limits (in greater than the assigned proportion) 
and this is taken as a signal for corrective action. 



Chapter III 


THE ROD OF MOSES— PROBABILITY GRAPHS 

3.1 Mathematical Tools Simplify the Application of Statistical 

Methods to Industrial Problems 

The method explained up to now follows the process closely. 
It can be described as being an almost synchronized method 
for recording the quality of the product. 

It iSj however, sometimes desirable and sufficient to have 
an historical picture of the quality, drat is, a sort of summary 
of the recordings made over a certain period of time. Such 
a method is, for instance, suitable for sampling inspection of 
components delivered to the factory for the purpose of assembly, 
and for the examination of the finished product (especially if 
the characteristic for which we sample is measurable or weigh- 
able). 

The principle underlying both Quality Control in the 
strict sense of the term and the method to be explained now, 
is the same. We want to obtain a reliable estimate of the bulk 
quality from samples taken from the bulk, and thus be able 
to compare the quality actually produced with the specifications. 

In quality control we achieve this aim by deriving from 
the actual samples control limits which enable us to estimate 
the bulk quality, and to compare it with the specifications. 

The method about to be explained goes still further as 
regards compactness of information. The whole information 
about bulk quality in relation to the specifications is now 
contained in a single line, whose position and direction allow 
us to draw certain inferences. It thus differs from the former 
method essentially in the degree of compactness of information, 
or of simplification. 

It is interesting to note that, in applying |hese methods of 
control, the examiner or inspector is handling rather involved 
problems of probability. To do this without adequate know- 
ledge of the theory of probability and statistical theory would 
be. impossible had not mathematical tools been provided for 

. ia 
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handling the matter in a quick and efficient manner. These 
mathematical tools are so simple that the matter has become 
accessible to practical people engaged in the manufacturing 
process, and can be handled without much previous knowledge 
of the theory of probability. 

We have already encountered one of those tools, which 
consists in differentiating between a pure chance distribution 
and a distribution disturbed by assignable causes, simply by 
counting the number of observations inside and outside the 
control lines. If the proportions of these classes of observations 
were those demanded by the theory of the normal curve, then 
we concluded that we were dealing with a chance distribution. 
Otherwise, we concluded that there were assignable causes 
interfering with the system of chance causes. 

Now we are coming to a mathematical tool which simplifies 
the matter still more, but wliich has the same theoretical basis 
— that is, the properties of the normal curve. 

3.2 The Principle of the Probability Graph* 

Let us assume that a quality control chart has been in 
operation for some time and that individual observations only 
have been plotted. Looking at the following diagram we see at 
once that the number of points belonging to certain intervals 
of the vertical scale, which represents our variable character- 
istic, differs from one interval to the other, although sym- 
metrically placed intervals show approximately the same 
density of points. 



Fio. 3.1 

A typical Control Chan for Individuals if the Process is under Control 


We could now group the points in such a way as to obtain 
a frequency curve. All we have to do is to push the points 

* The use of a specially constructed probability grid is due to A. Hazen 
^Transactions A.S.C.E., Tl, 1639-1640 (1914)). As a method of industrial control 
it Was developed in the Ministry of Supply during World War II. 
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towards the left-hand scale while preserving the vertical 
distance of a point from the origin of that scale, rotate the 
chart back from where we started — ^paragraph 2.3 — that is, 
through an angle of 90°, and erect over the intervals rectangles 
whose area is proportional to the number of points belonging 
to that interval. We can now what is called “ sum ” the area 
of the curve from one end to the other simply by putting one 
rectangle on top of the other, adding thus the frequency of all 
the intervals successively. Joining the tops of the rectangles 
thus obtained, we arrive at an S-shaped curve called the 
ogive or cumulative curve. 

What follows now is reminiscent of the Biblical story 
according to which the snake writhing on the ground was 
transformed into a straight rod when the fearless hand of 
Moses lifted it off the ground. It is possible to transform the 
wriggly shape of our curve into a straight line by choosing 
suitable interwals on the vertical scale, such as will stretch that 
curve into a straight line. All one need do for that purpose 
is to space the intervals, not equally as in Fig. 3.1, but 
according to the probability integral or the Error Function. 
Tables for that function are available which tell us the per- 
centage of observations which is expected to lie up to a certain 
distance either on both sides of the mean or from the left-hand 
start of the curve. Graph paper constructed on these lines is 
commercially obtainable, firom which we can read what 
percentage of observations is expected to lie within a given 
distance from the left-hand origin of the cumulative curve. 

If we now wanted to get an estimate of the quality of the 
bulk of a certain article we would pick a random sample of 
sufficient size, at least 100, then plot the cumulative curve, 
according to the rules given in the following paragraph, on 
Probability Graph paper, and proceed to translate the geo- 
metrical picture so obtained into physical terms. 

As seen from the following diagram, there is a point marked 
60 on the right-hand scale, meaning 60 per cent, of the distribu- 
tion. Projecting that point on to thb line representing our 
sample, and then projecting the crossing-point on to the 
horizontal scale, we read on that scale the mean of the dis- 
tribution. Furthermore, projecting the point marked 16 or 
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84 on the right-hand scale on to our line, and again projecting 
the crossing-point vertically downwards, we read on our scale 
the Standai'd Deviation, as tlie distance of that point from 
the mean. 

Why this is so will become clear when the construction of 
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the diagram has been explained in detail (3.3 and 3.7). All 
that should be noted now is that it enables us to obtain, merely 
graphically and without any calculation, the two important 
parameters of the normal curve, the mean and standard 
deviation. 

3.3 How to Plot a Distribution of Measurements on Probability 
Graph Paper 

The method of constructing a probability graph is as 
follows. Let us assume we have a number of determinations 
taken on a certain characteristic such as the weight of a com- 
ponent of a mixture. These varying measurements of our 
characteristic are then, as a rule, grouped — that is, we divide 
the range between the smallest and tlie largest measurements 
into at least 10 ecpial parts and so obtain 10 or more intervals 
for our variable (13 in our example). 

We now ascertain how many of the 327 determinations fall 
into each of these 13 intervals, and thus obtain what is called 
the frequency distidbution of our sample. 

Columns 1 and 2 in Table 3.1a (page 51) represent the 
range of values for a given characteristic and the frequency 
distribution for a sample 0 / 327. Column 3 gives what is 
called the cumulative curve, which is obtained by summing 
the series of column 2 from one end to the other, start- 
ing at the lowest value of the variable. 

Comparing the position of the series in columns 2 and 
3 we see the following difference. The class frequencies of 
column 2 are placed opposite the centre of each interval, 
for instance 69 being the centre of the interval 68 to 70, 
whereas the cumulative values of column 3 are placed opposite 
the end of each class interval. The reason is this. The group- 
ing of a number of measurements, 327 in our case, into a con- 
siderably smaller number ofintervals, introduces an element of 
uncertainty. As we are only interested in whether a weighing 
falls into one group or another, the weighing in one interval 
can range from the lower limit to the upper limit of that interval. 

In order to be able to treat these observations as having a 
definite value, which is essential for calculating the parameters 
of the distribution, they are by convention placed against the 
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centre of the interval, and this is the value which is given to 
them. 

If it now comes to summing up the area under the curve 
represented in column 2 from one end to the other, we must, 
in order to arrive finally at the end of the last interval, proceed 
in steps of whole intervals, starting at the end of the first 
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interval and then going on to the end of the second, end of 
third, and so on. Our cumulative values are therefore to be 
placed not against the central value of the interval, but against 
the end of each interval. This has the effect of shifting series 
column 3 upwards by half an interval. 

Column 4 gives the reduction of the series in column 3 to 
100 in our case, by dividing each value by 3'27. The figures 
in column 4 then represent percentages, and it is these per- 
centages which are plotted on tire probability graph. 

3.4 The Effect of Grouping 

Let us dwell a little while on the effect of “ grouping.” 
It is evident that the size of the interval will have an effect 
upon the shape of the distribution and thereby upon its 
parameters. The question now arises : What governs the size 
of the interval ; and secondly, how does grouping as such affect 
the distribution ? 

1. Whenever we are taking measurements of a certain 
dimension, the intervals of the scale according to which we 
want to group the measurements are dependent on the pre- 
cision of the measuring instrument. If, say, the instrument 
measures to a precision of ±'6 thousandths of an inch, then 
there would be no use in having a scale with class intervals of 
less than -5 thousandths, although the actual precise dimensions 
do not jump by -5 thousandths in. but vary continuously. 

All one can do in such a case is to take the attainable 
precision of the instiument as representing the minimum value 
of the class interval of the scale. What this amounts to is 
that we take the centre of a class interval of the attainable 
precision to represent the value of the variable. 

What is necessity in this case becomes a convention when 
we are dealing with variables which we find it advisable to 
group into much larger intervals than would correspond to the 
precision of the measuring instrument, because what matters 
economically in such cases is only a certain interval of the 
variable. We then treat these intervals as if they were the 
intervals attainable by the precision of measuring and regard 
the centre of the interval as representing the value of the 
observations falling into that interval. 
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2. Within each interval there will be a miniature frequency 
distribution, with a mean value somewhere near the centre of 
the intei’val and a total range equal to the interval itself. 

Assuming each such distribution to be normal, we could 
take the centre of the interval as representing the mean. 
This is the theoretical basis for regarding the interval centre 
as representing the value of all observations falling into that 
interval. 

But it is clear from the shape of the total normal distribution 
that the distribution within the intervals will differ from 
normal ; the values near the inner end of the interval will 
always be more frequent than those near the outer end. The 
actual mean of each interval will therefore be displaced 
towards the inner end of the intervals. This necessitates a 
correction of the standard deviation obtained under the 
assumption of normality of the interval distributions, by 

subtracting from the squared standard deviation where c 

is the number of units per interval. This correction is called, 
after its otiginator, Sheppard’s correction. 

3*5 The Distinction Between Bulk (Population) and Sample Quality (I) 
— The Chi-Square Test 

We have now to consider whether the straight line prob- 
ability graph can be taken to represent adequately the bulk 
from which the sample camq. The problem has two stages : 

1. Taking the mean and standard deviation of our sample 
to be those of the bulk, can the line be regarded as a reasonably 
good representation of that bulk distribution ? ; and 

2. To what extent arc we justified in identifying the mean 
and standard deviation of our sample with those of the bulk 
of the product in which we arc ultimately interested ? 

These questions will be dealt with in this and the following 
paragraphs. 

The implication of the Straight Line Probability Graph is 
that the line represents a bulk with the same mean and standard 
deviation as we have found to characterize our sample. There 
vviU be, as a rule, deviations between the class fi,’equeiicies of our 
( 612 ) 5 
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sample and those of the theoretical bulk distribution, and the 
question is now whether the deviations resulting in a particular 
case can be regarded as being due to fluctuations of chance 
only, or whether they are too great to comply with our assump- 
tion that the straight line graph really respresents the bulk from 
which our sample came. 

Considering that a normal distribution is entirely deter- 
mined by the two parameters, mean (M) and standard devi- 
ation (cr), we construct (instead of drawing it free-hand) tlte 
straight-line probability graph representing a normal distri- 
bution with the same M and cr as our sample, as a preliminary 
to carrying out the test. 

All we have to do for that purpose is to standardize the 
class intervals measured from the distribution mean, that is, 
to divide the deviations of our variable from that mean by a, 
read from the table or graph for the cumulative normal curve 

(see Appendix I) the percentages corresponding to the values ~ 

and plot these percentages on ■probability -paper as the ordinates 
of the corresponding values of at, (see also paragraph 3.7). 

As an illustration let us use the data of Table 3.1, whose 
second half (3.16) shows the theoretical distribution, Its 
cumulative percentage frequencies are nothing else but tlie 
percentages which we have read off the Probability Integral 
Table or Graph. They are given in column 4, whereas 
column 1 contains the interval ends instead of the 
interval centres of our original distribution (Table 3,1c), 
column 2 the deviations of these upper interval limits from the 
mean, and column 3 the standardized deviations. The fre- 
quencies of column 4 are now plotted on probability graph 
paper in exactly the same way as the cumulative percentage 
frequencies of the actual sample but without shifting the series 
by half an interval, as they represent already the cumulative 
frequencies right up to the respective interval ends. 

From the cumulative percentage series we obtain the theoreti- 
cal distribution (column 6) by forming the differences 

of successive terms in the former. This series is now compared 
with the observed distribution reduced to a sample size of 
100 (column S, Table 3.1a). 
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The next step is to determine the diffciences between the 
observed and the theoretical frequcnciesj term by term, 
squaring these differences, dividing by the corresponding 
theoretical frequency, and summing. (For the purpose of this 
test, class intervals should contain not less than five observa- 
tions. The last four intervals weie, therefore, contracted into 
one, and so were the first two, reducing the number of intervals 
to nine.) 

The result is called or Chi square. This function, like 
so many others used in statistical work, has been tabulated and 
charted, the horizontal axis of the chart representing the various 
probability levels and the vertical what is called “ Degrees of 
Freedom.” We arc familiar with the concept of “ Probability 
Levels,” but the expression “ Degree of Freedom ” needs some 
explanation. If a frequency distribution of altogether N observa- 
tions is grouped into n intervals, then it is clear that we can fill in 
only n — 1 interval frequencies arbitrarily ; the frequency of the 
ftth interval is uniquely determined as the difference between 
the sum of the already allocated frequencies and N. If now the 
“ filling in ” is left to chance, we say that the freedom from 
restrictions is measured by the number of intervals whose 
frequencies are not dictated by anything. We thus speak of 
the number n — 1 in this sense as so many Degrees of Freedom, 
But the play of chance is also restricted by using certain quanti- 
ties characterizing the observed distribution for constructing 
the theoretical series, in our case by using M and a. For each 
of these constraints we subtract unity, and arrive thus in our 
case at the number 9 — 3 = 6 with which to enter the table 
of X ^ oh the vertical scale. In the body of the table wc now look 
for the number nearest to our obseiYcd x® for 6 Degrees of 
Freedom, and having found it, read on the horizontal scale 
the probability with which a x“ like ours, and therefore our 
sample, could have arisen in the bulk represen ted by the straight- 
line graph. 

A probability greater than OTO and smaller than 0-90, or 
something between 10 per cent, and 90 per cent., is required to 
support our assumption. In our case, x® — 4-31, w = 6, and 
we read P as somewhat greater than 0'60, which means that 
such deviations as we have encountered between our sample 
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frequencies and the corresponding theoretical ones could have 
arisen about 60 times in 100. Wc have thus no reason to 
abandon our assumption, and can regard the theoretical straight 
line as a satisfactory representation of a bulk showing the same 
M and n as our sample. 

If a x^-table is not available — it was not included in our 
Appendices in order not to swell the number of Appendices 
inordinately — we can make use of the relation between the 
X ^-function and the Poisson Summation series (see Chapter VI) 
and read P from the chart, Appendi.x Vb. We enter the chart 

, y ^ 

With — as the mean value (on the horizontal scale) and with 
2 

Tl 

- as c (the ordinal number of the curves which run across the 
2 

chart), and read P on the left-hand scale.* In our example, 
we enter the chart with m = 2.15 and t = 3, counting 0 as the 
first term in the partial summation series, and find P somewhat 
greater than -60, which is in good conformity with our first 
value of P. 

It should, however, be noted that for practical purposes 
we seldom go to the trouble of constructing the theoretical 
graph for our assembly of points and testing by the x^-niethod. 
Free-hand fitting of a straight line to our distribution is quite 
admissible, and will in the majority of cases serve our purpose. 

3.6 The Distinction Between Bulk (Population) and Sample 
Quality (II) 

We now come to the second part of the problem. It Was 
stated above that what we desired to obtain by the probability 
graph method was, in principle, the same as what we get by 
the Quality Control method, viz. an estimate of the bulk from 
the quality of the samples. As yet, however, our probability 
graph represents only the bulk as judged firom one sample, 
that is, a bulk having the same M and cr as that sample, although 
this was a much bigger sample than we had formerly taken in 
Quality Control work. What is the relation of that picture of 
the bulk to the “ true ” bulk in which we are finally interested ? 


* This obviously applies only for even n 
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Let US imagine a bulk of articles containing a very great 
number, say several millions, of individual articles, then it is 
evident that if we pick a number of samples of 100 from that 
bulk we shall get samples whose means and standard deviatlbns 
differ slightly from sample to sample, and also from the bulk 
M and o-. It follows that, not only have the individual articles 
a frequency distribution, but the sample means and sample 
standard deviations as well, and consequently that we shall 
get a slightly different estimate of the bulk quality from each 
of these samples. We know already from paragraph 2.7 that 
the distribution of sample means is in its form like that of the 

individuals, and that its standard deviation = Zs . For the 

Vn 

distribution of the standard deviation the formula is more 
complicated. Provided the sample is great, we can take it as 

being given by ug.-o. = . 

V 2 ra 

If n is sufficiently great it is evident that, although the line 
representing our sample cannot be assumed to be exactly that 
representing the bulk, yet it will not be very far from it. Utiliz- 
ing the information of the distribution of means and standard 
deviations embodied in the above formulae, we can fence off, 
so to speak, on either side of our cumulative line, an area within 
which the line representing the bulk may be safely expected to 
lie, and conversely, if the true bulk line was given, the area in 
which the lines derived from samples of a certain size might be 
expected to he. This is done in. the following diagram (3.4). 
We draw lines Lj on each side, parallel to our sample dis- 
tribution line L, with means M T , representing samples of n 

vn 

with the same standard deviation as our original sample, but 
whose means are the extreme values still compatible with the 
assumption that they are coming from the same bulk as the 
original sample. 

We then measure on either side of M on the horizontal 

3q. 

scale a ± , erect vertical lines at these points, find the 

crossing-points with the horizontal lines drawn at the 16'6 and 




M cr=io (N=ioo) 
Fio. 3.4 


Probability Graph of Sampling Distribution for Mean and a 

83’ 5 per cent, levels and connect the points so obtained with the 
centre of L (La, L 4 ). These Hnes represent samples of n, having 
the same mean as onr original sample, but whose standard 
deviations are the extreme values still compatible with the 
assumption that they are coming from the same bulk as the 
original sample.* 

* The construction of such lines is due to a suggestion by Mr. A. Blackwell 
Ministry of Supply * 
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We could now draw a freehand curve fitting both the lines 
for extreme averages (Lj, Lg) and those for extreme standard 
deviations (Lg, L 4 ) on each side of L. The area so obtained will 
contain the “ true ” distribution. 

In practice, however, we do not usually take so much 
trouble in estimating the position of the line representing the 
bulk. If the sample size n is great, areas on either side of the 
observed line become negligibly small, so that the bulk line 
derived from our sample can be taken to approximate the true 
line sufficiently well for the purpose in hand, that is, for estimat- 
ing bulk quality. 

Even so, that is, taken by itself, it is more informative than 
the discrete points would be. 

3.7 Probability Graph and Specifications — Examples of Straight- 
Line Probability Graphs 

The most important information we can derive from the 
probability graph, apart from that about mean and standard 
deviation, is that we are in a position to estimate the percentage 
of the bulk within and without the specifications. All we have 
to do for that purpose is to erect perpendicular lines at the 
points on the horizontal scale coinciding with the specifications, 
and to project the crossing-points with our line to the right-hand 
scale. There we read the percentages of the product within 
and without the specifications. The following contains an 
illustration. 

In the diagram Fig. 3.6 it is assumed that an article, say, 
brass discs, with a nominal thickness of TOO in. is being pro- 
duced, and that either the precision of the measuring instrument 
or the engineering requirements make it advisable to group the 
measurements in intervals of -5 thousandth ('0005) of an inch. 
Accordingly, the horizontal scale at the bottom of the graph is 
in '5 thousandth of an. inch. We now plot the cumulative per- 
centage series by putting a dot on the graph wherever the per- 
pendicular erected at the value of a class interval limit crosses 
the horizontal drawn on the corresponding percentage. 

We thus get a distribution of points which will in a great 
many cases be satisfactorily fitted by a straight line. Let us 
assume that in a given case such fitting by a straight line is 




Fig. 3.6 

Typical Straight-Line Probability Graphs 

A Line of controlled dimension : specification line (0 per cent, outside specifica- 
tion). 

B Levd of dimension too high by 2 intervals of measurement. Spread as before 
(5 per cent, outside specification). 

G Level of dimension as demanded by specifieafion. Spread has increased 
(46 per cent, outside specification). 

D Level of dimension too high. Spread has increased (48 per cent, outside 
specification). 

81 
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possible, and let us further assume that the specifications for 
the product under consideration are d: 2 thousandths of an inch. 
At these specification, values we erect perpendicular lines. 
Furthermore, we draw horizontal lines at the points 1 per cesnt. 
and 99-9 per cent., because in practical work we hardly ever 
get a perfect curve asymptotically approaching the x-axis. 
Consequently, the upper and lower horizontal strips of the 
graph paper are not suitable for practical purposes. 

{a) If our product was just as specification demanded we 
Would obtain line A. We sec that no part of this line, and 
therefore nothing of the product, lies outside specifications. 

Considering that we are dealing with a symmetrical dis- 
tribution, it follows that the peak should be at the centre, that 
is at 50 per cent, of the number of observations. We therefore 
find the mean of the characteristic in question by projecting the 
crossing-point of the horizontal at 60 per cent, with the Line 
of Distribution on to the bottom scale. We see that it is just 
the level required by specifications. Furthermore, considering 
that the area of the curve within the standard deviation on both 
sides of the mean will be to the area without as 2 ; 1, we are 'now 
in a position to find the standard deviation graphically. 

The ratio 2 : 1 corresponds to 67 per cent. : 33 per cent, 
approximately. Sixty-seven per cent, of the observations should 
be inside . d: once the standard deviation from the mean, and 
M ± 33-6 per cent, will thus give a clue for finding the standard 
deviation. As M is at 60 per cent., M d: 33'6 corresponds to 
83-5 per cent, and 16-5 per cent., and we obtain the standard 
deviation by finding the crossing-point of 16-5 or 83-6 per cent, 
with our distribution line, projecting it on to the bottom scale 
of the variable and reading there the distance from M to the 
point so determined.* This gives nearly -7 thousandth of an inch. 
Multiplying this amount by 6 should (see paragraph 2.6) give 
the total range. The range so obtained approximates that laid 
down by specifications since the total range within specifications 
is thousandths of an inch, and 6 X ’7 == 4-2. 

(b) Let us assume that we had obtained line B repre- 


’ Conversely, having calculated M and a, we can ^jiunediately construct the 
straight line with the given M and a 
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senting our cumulative curve. We sec that part of the product 
will be outside specification. The percentage of the product 
outside specification is given by projecting the crossing-point of 
liim B with the upper specification line towards the right. 
We read 95 per cent, approximately “ within,” 6 per cent, 
above specification, and notliing below specification. This 
indicates that the level of the product has changed. We can 
make certain about this by projecting the crossing-point of our 
line with the horizontal at 50 per cent, downwards, and find 
that the mean has changed by 1/1000 in. The average dimen- 
sions of our product will therefore be -lOl instead of TOO. 

Constructing the standard deviation as in the former case, 
we find that it has remained unchanged. 

Looking at the graph we sec at a glance how to infer a 
change in the level of production, but stability as far as range 
of variation goes. In such a case the line will be parallel to the 
original one representing a product completely within specifica- 
tion, but shifted towards either right or left. 

(c) Let us assume that out cumulative curve is adequately 
represented by line C. We see at a glance again that part of 
the product will be outside specifications. Applying the same 
method as above, we read that about 23 per cent, will be above 
and 23 per cent, below specifications. Constructing the mean, 
we find that the mean, and therefore, the level of production, 
is exactly as required. Constructing the standard deviation, 
we find that a considerable change has occurred, as the standard 
deviation is now 2-5 thousandths of an inch as compared with 
■7 thousandth before. The line has not changed its position com- 
pared with line A, but its direction is different, and it is such 
a change in direction — without a change in position — which 
indicates an increase, or generally a change, in tire variability 
of the product, 

{d) If line D represented our sample, we could at a 
glance infer both change of mean and change of standard 
deviation, because line D is shifted towards the right, and 
is no longer parallel to line A. This is confirmed by con- 
structing the mean and standard deviation for this line. We 
find the mean at 1 instead of at 0, and the standard deviation 
is 2-6 thousandths instead of *7 thousandth. 
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3.8 The Translation of the Geometrical Properties of Probability 

Graphs into Physical Terms 

This short discussion of some of the characteristic differences 
in probability graphs gives an indication of how to translate 
the geometrical picture into physical terms. Having a line of 
comparison such as line A for a sample that meets specifica- 
tions fully, and plotting lines for the samples actually measured, 
we can, from a change in the geometrical picture, arrive at an 
approximate idea oftlie cause of disturbance (due to assignable 
causes in the economic sense of the word). A change in 
position without a change in direction would indicate that the 
level of the product had changed but not its variability, and 
therefore that something had gone wrong with the setting of 
the machine. A change in direction without a change in 
position would indicate that although the level of production 
was correct, the variability of the machine was too great, and 
that matters should be investigated with a view to finding 
out whether machine tools have become worn, parts of the 
machine loose, or whether there was too much variability in 
the operation itself. 

If a sample was represented by a line which appeared 
shifted compared with the standard line, and also had a 
different direction, we would have an indication that both the 
setting and the variability of the machine wei'e at fault and 
required investigation. 

3.9 Typical Non-straight Line Probability Graphs 

What has been said about probability graphs so fax may 
have given the impression that the points of the cumulative 
curve will always plot to a straight line on probability paper. 
But this is by no means the case. The question now arises how 
to interpret the various typical shapes of graphs. The following 
paragraphs are a short guide to the correct interpretation of 
probability graphs. 

They should enable the reader to differentiate between the 
various shapes of graphs, and to draw conclusions from them 
about tire quality of the product yielding such graphs. It 
should, however, be kept in mind that here, as in every branch 
of practical work, statistics are not meant to replace but only 
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to supplement common sense, and we must not suppose that 
a graphical method like the probability graph method is capable 
of performing wonders. 

As has been stressed throughout the “ Guide,” the method 
is not primarily meant to provide definite information about 
the manufacturing process of the articles under inspection, 
though valuable liints as to that process are certainly derivable 
from it. What we are primarily interested in when applying 
the method of probability graphs of the Normal Law type is 
the percentage of the product within the specified limits laid 
down for the characteristic in question. 

(1) Form the cumulative curve and plot on Arithmetic Pro- 
bability Paper (A.P.P.) according to the instructions given 
in 3.3. Do not omit to shift the cumulative series upwards 
by one-half of an interval. 



Fin. 3.0a Fig. 3.0A 

A.P. Graph Ordinary Graph 


(2) If the points plot fairly well to a straight line, read the 
percentage of the product within specification from the 
percentage scale on the graph. The decision as to what 
is a “ fairly good ” fit must be left to the statistical tact 
of the investigator. Such tact will be acquired by ex- 
perience.* The type of curve that will plot to a straight 
line approximately on A.P. paper is the well-known bell- 
shaped normal curve. 

Be it observed, however, that the conformity of a distri- 
bution with the requirements of the normal curve does not 
say anything about the distribution being a “ simple ” one. 
By tills is meant that it does not imply that the manu- 
facturing process was controlled in all its parts around 
one and the same mean, and with one and the same degree 
of dispersion. The normal distributions which we meet 
* As an alternative, the x’-test may be applied 
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with in industrial work will be more often the result of a 
great number of distributions around different means. 

(3) A diagram of this kind results from a distribution cucve 
with more than one mode or peak. 

Observe that the inflection (double bend) is in its first 
part clockwise and in its second counter-clockwise. Such 
a curve is often the result of the mixing of a number of 
distributions around different means. What interests us 



Fio. 3.6c Fm. 3.6i^ 

A,P. Graph Ordinary Graph 


chiefly in practice is not the way in which the curve was 
produced, but rather how to judge the percentage of 
product within specifications, though the indications as to 
the nature of the distribution should by no means be 
neglected. If the inflection shown on the probability 
graph is too pronounced, then the percentage scale of the 
A.P. paper should not be used. 

We can, however, by other means arrive at an estimate 
of the frequency within certain limits. The method is as 
follows : 

Calculate the standard deviation and divide the distance 
between tolerance and specification mean by the standard 
deviation. This gives a quantity “ t” We then have 

P ^ 1 — i . This formula, known as Tschebychpff 
z 

Inequality, means that the percentage of the product lying 

within specifications will be at least equal to 1 — -i . Gon- 

sequently, 1 ~ P or 1/t® will give the maximum percentage 
of pieces outside specifications, as judged from the sample 
in hand, for a bulk having the same mean and standard 
deviation as our sample. 
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What this amounts to is that whatever the form of the 
distribution, the proportion of the product deviating from 
the mean by more than 
” 2cr is less than 1 /4 

3c- is less than 1/9 
4a is less than 1/16. 

A determination of this kind will in the majority of cases 
of examination of components be found to give a sufficiently 
close estimate of the quality, because the purpose of the 
examination is not to determine the exact percentage of 
defects in the bulk, but to spot any unusually bad lot of 
components. ss 

(4) A curve of this land results from a symmetrical onc-pcakcd 
distribution curve, whose central ordinate (height) is 
greater than it should be according to the Error Function. 
Observe the characteristic difference between the inflection 
in this case and that of curve type (3).* Here the first part 
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A.P. Graph Ordinary Graph 


of the double bend is counter-clockwise and the second 
clockwise. Such a curve is often the result of a mixture of 
a number of normal distributions around the same mean 
but with varying standard deviations. But just as with the 
type of curve given under (3), it need not be. It can be 
conceived as representing a distribution function, according 
to which frequencies of the central values are excessive. 

In such a case we again camiot employ the percentage scale 
of our A.P. paper for estimating the part of the product 
within and without specified limits. But a formula, similar 
to that given under (3), allows an even closer guess at the 
percentage of the population lying within such limits. 
Calculate the standard deviation and express the distance 

* To this fact attention, has bceftdrawnb-y Mr. E, H. Scaly, formerly Ministry 
of Supply 
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of tolerance minus specification mean as a multiple of the 
standard deviation. This gives a quantity “ t.” Then we 

have P > 1 i — . This formula is known as the Gamp- 

2-25i^ 


Meidell Inequality, and the meaning is analogous to that 
explained under (3). It gives the minimum percentage of 
the product that can be expected to lie within specifications, 


and, therefore 


represents the maximum percentage 

2 * 26 ^^ 


outside specifications, for a bulk having the same mean 
and standard deviation as our sample. 

(6) A curve of this kind results from a symmetrical one-peaked 
distribution in which the fi’equencies towards the extreme 
end of the distribution are greater than they should be 
according to the error function. The type of curve leading 
to such a graph is shown below. The characteristic of the 
distribution is that it is deficient in height for values 
between central and extreme values compared with a 
normal distribution, and such a deficiency may be caused 
by any tendency in the process towards depleting the 
number of values between medium and extreme values 
beyond the percentage provided for such values by the 
error function. If the S-shape on the A.P. graph is too 
pronounced we ought not to use the percentage scale of 
the A.P. paper for estimating the quality of the lot, but 
apply the same metliod and formula as given under (4). 
The properties of the curves of types 3 and 6 are often 



Fig. 3.6g Fig. 3.0/! 

A.P. Graph Ordinary Graph 


found to combine and then constitute what is known as 
“ leptokurtic ” (narrow humped) curves. In fact, the two 
graphs On probability paper are clearly seen to be only 
the extreme forms of one and the same type of inflection. 
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Fig. 3.Gi Fig. S.fij 

A.P. Graph Orciinaiy Graph 


(G) A curve of this type results from a one-peaked symmetrical 
distribution, in which the values about half-way between 
mean and extreme ends are relatively too frequent, taking 
the error function as a standard comparison. It may re- 
sult from a mixture of normal distributions, but it may also 
be a homogeneous distribution. If the inverted S-shape 
of the graph is too pronounced, the same method as 
given under (4-) is to be used for estimating the percentage 
of the product within specification tolerances. 

The properties of curve type 4 and 0 are often found to 
combine and then constitute what is known as the “ platy- 
kurtic ” (broad humped) curves. In fact, the two graphs 
on probability paper are clearly seen to be only the extreme 
forms of one and the same type of inflection. 


3.10 Logarithmic Probability Graphs 



Fig. 3.7a 
A.P. Graph 



If the graph represents a simple curve, either concave or 
convex to the horizontal, Logarithmic Probability Paper 
(L.P.P.) should be applied for judging the percentage of 
the product within specified limits. In order to under- 
stand what happens in this case, and what the logarithmic 
transformation means, let us consider the following illus- 
tration. 

( 612 ) 


6 
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A sample of 200 small brass shells was measured with a 
micrometer with regard to their external diameter whose 
specifications are -316 in.— 317 in, A frequency distribution 
was drawn upj and the cumulative curve plotted on probability 
graph paper (Table 3.2, Fig. 3.8). 


Table 3.2 

Spec. Mean = -3165 Spec. Limits ±-0005 


Dimension 

No. of 
samples 

Cuin. 

curve 

Gum. 

0/ 

/o 

Dimension 

No. of 
samples 

Cum. 

curve 

Cum. 

% 

•31575 

1 



■317375 


191 

96’5 

■315875 


1 

■5 

■31750 

6 



■310 

34 



■317625 


197 

98'5 

■310125 


35 

n-5 

■31775 

— 



■31625 

8 



■317875 


197 

98^5 

■310373 


43 

21-5 

■318 

2 



■31660 

116 



■318126 


199 

99^6 

■31662,5 


15!) 

lft-5 

■31825 

— 



■816755 

8 



■318375 


199 

99^6 

■310875 


167 

8S-5 

■31850 

— 



■317 

24 



■318625 


199 

99^6 

■317125 


191 

d5-5 

■31875 

1 



■31725 




■318875 


200^0 

100 


According to that determination 6 per cent, of the lot are 
assumed to lie beyond the upper limit of -317. 

Looking at the graph, which contains the specification lines 
and the straight line which is supposed to represent the 
distribution, we see that it is drawn in such a way as to leave 
out the upper tail of the distribution. It is clear from the 
graph that if one wants to include all points, a curve (Lg) 
should be drawn. Such a curve, however, would not serve 
our purpose as it would not admit of any inference as to the 
percentage in the bulk inside and outside specification limits. 

The shape of the distribution, in conj unction with what is 
known about the particular process by which these shells are 
produced, points to logarithmic probability graph paper as the 
proper kind of probability graph to use. 

The considerations are briefly these. The shells are drawn 
out by means of a die in continubus production. If that die, 
at the beginning of the manufacture of the lot, was made 
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Fio. 3.8 

Probability Graph where the Plotted Points follow a Curve 

precisely to the correct diameter, it will changp ever so slightly 
from one shell to another until the deviation from the nominal 
value of the diameter becomes perceptible to the sensitivity of 
the micrometer. 
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If now the die was ever so slightly alTcctcd by the shells 
which it turned out, that is, if the dimension was gradually 
changing, i.e. growing, then the diameters of the shells would 
not any more be entirely independent of one another, au is 
essential for a normal distribution. Each of the observations 
would now be influenced by the foregoing one. The dependence 
between the diameters of successively turned out shells is 
evidently such that, in addition to the unavoidable random 
variations, an increment was added to each measurement in 
proportion to the magnitude of the last observation. 

In such a case the error function appears to be intimately 
linked up with the functional relation for that kind of dependence 
of increments, viz. with the Law of Growth which is of the 
logarithmic type. 

The consecjuence is that the distribution will no longer 
conform with the lequiremcnts for a normal distribution, 
but becomes skew. It can, however, be transformed into a 
normal one if a slight change is introduced in the scale.* 

Remembering that tlie type of function governing such 
dependence of increments as has been described above is a 
logarithmic one, it is easily seen that a logarithmic instead of 
an arithmedc scale would reduce the curve to normal. This 
is imperative if one wants to esdmate the percentage of observa- 
tions lying within certain limits by means of probability graph 
paper. 

Such a change in the scale is effected by means of logarith- 
mic probability graph paper, which groups the observations, 
not according to equal distances, but according to equal ratios. 

Consequently, the data obtained from Table 3.2 were 
plotted on logarithmic probability graph paper in the following 
way. Just as is the case of A.P. paper, the cumulative percentage 
frequencies are plotted against the end of the class interval, 
against whose central values they are supposed to stand. The 
values of these interval limits are identified with the successive 
numbers of the logarithmic scale given on the logarithmic graph 

* Distributions of this Itmcl have been studied by Sir F. Gallon and McAlister, 
Proc. Roy. Soe,, voL 29, 1879, pp, 366, 367 ; G. Th. Fechner, Kollektkniasslehre, 
Lelpsic, 1897 j J. G. Kapteyn, Skew Frequency Curves in Biology and Statistics, 
Groningen, 1903. , 
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paper (the long side of the graph rectangle being taken as the 
basis) . 

It should be remembered that the effect of using a logarith- 
mic scale is to open the steeper part of the skewed curve, and 
to push the more gently sloping part together. Consequently, 
if the steeper part happens to be towards the lower values of the 
variable, the |cale should ascend from the bottom of the graph ; 
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if it happens to be tovvards the highei- values of the variable, 
the scale should ascend in the opposite direction. 

As was to be expected if the above assumptions were correct, 
the points plot on logarithmic probability paper satisfactofily 
as a straight line (Fig. 3.9). 

The practical importance of the change in method is that, 
whereas according to the graph on A.P. paper only 6 per cent, 
of the measurements are above the critical value of -SI?, it 
appears now that approximately 10 per cent, are above this 
value. The correctness of this estimate was confirmed from a 
bigger sample. This shows that the use of A.P. paper in cases 
where the distribution is far from normal might lead to 
erroneous conclusions. 

It should be noted that the mere fact of obtaining an 
empirical fit by means of logarithmic probability paper is no 
strict evidence as to what the law governing the distribution 
is like, but there is no need for having strict evidence on this 
point in practical industrial work. We are not concerned here 
with ascertaining laws of distribution, but with estimating the 
percentage of a product lying within certain limits. Whenever, 
therefore, we are in a position to reduce by a legitimate method 
a skew curve to normal, and the cumulative curve to a straight 
line, we should make use of it, and this is what a logarithmic 
probability mesh can do for us. 

It is important to realize that although the different shapes 
of the curve on A.P. and L.P, paper will often allow a guess as 
to the possible cause of the deviation from normality, it is not 
this that chiefly interests us here, because the manufacturing 
process of piece-parts or components is, considered from the 
viewpoint of the further assembling process, a thing of the past, 
and because any opinion about the otherwise unknown manu- 
facturing process can only be of a tentative nature. We are more 
concerned with estimating the quality of the product in order 
to ascertain whether or not it is fit to go into further production. 

The fact we want to ascertain is, tlrerefore, the percentage 
of the product within specified limits for a given characteristic. 

In brief, what matters chiefly is not whether the manu- 
facturing process to which we owe the components was under 
statistical, but whether it was under economic control. 



Chapter IV 


THE SLIDE RULE PRINCIPLE OF PROBABILITY 
—THE ANALYSIS OF VARIANCE 

4.1 The Necessity for a More Powerfijl Method of Analysis 

Up to now we have learned how to ascertain statistically the 
presence or otherwise of certain causes of vaiiation in the 
quality of industrial products (assignable causes), but our 
methods so far gave no clue as to the extent or amount or 
intensity of such causes. In order to obtain numerical estimates 
of the effect of individual causes we must employ other more 
powerful methods. One of these is called the Analysis of 
Variance, because it is possible by means of that method to 
split the combined effect of a number of assignable causes 
plus random variations into the parts which are due to the 
various contributions, and so disentangle the contributing 
factors quantitatively. 

As an easy way of approach to the method of analysis of 
variance I shall choose the control chart method which, as we 
shall see, is in itself a sort of analysis of variance. I believe 
that this way of approach will go far to dispel any apprehension 
as to the lai-ge amount of higher mathematics needed for the 
understanding of the analysis of variance. 

4.2 Quality Control Charts as a Simplifted Analysis of Variance 

Let us assume that an article is produced on a five-spindle 

automatic in three shifts, that samples of 10 are taken at regular 
intervals, and that the article is measured for a certain char- 
acteristic, say, diameter. In order to set up a quality control 
chart we calculate the average range of a sufficient number of 
samples of lOw, multiply with the factor which transforms 
it into the standard deviation of tire population, divide by s/n 
and multiply by 3'09. 

The reason for the two last-mentioned operations is this ; 
as we are plotting averages of 10, we want control lines for these 
averages ; and the ratio in which the standard deviation of the 
population is reduced in the case of averages is l/V”* We thus 

75 
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obtain the standard deviation of the distribution of averages, 
and multiplying tliis quantity by 3-09 gives the position of the 
1/1000 control lines within which practically all the averages 
should lie. A simpler but equivalent way is to multiply 
S,, bv A' 

(a) Let us first assume that the five spindles and the three 
shifts all produce practically the same quality of our article with 
regard to the characteristic in question. This means that all 
the samples of 10 which we obtain in a given period of time 
will be VC17 much alike though not precisely the same. As far 
as they differ, this could be accounted for by mere chance or, 
as it is also called, “ fluctuations of sampling.” In such a case 
the sample means will themselves be distributed more or less 
normally around the true mean with a standard deviation 
amounting to the standard deviation of the population, cr, 
divided by y/ti (standard error). 

As the quality control chart limits are 3'09 times that 
quantity on either side of the mean, it follows that if the samples 
of 10 are only distinguishable from one another by sampling 
fluctuations, practically all the averages should fall within our 
control lines. 

A Quality Control Chart of that Knd therefore represents 
samples from one and the same population. 

— — - 7I000 

__ _ _■ y^Q 

* M 



'/40 

'/lOOO 


PtG. 4.1 

Control Chart showing Process under Control 

(b) Now let us assume that the different spindles are set 
sHghdy differently so that the level or averages of the dimension 
in question will not be the same for all the spindles. In such 
a case, samples of lO coming from different spindles will be 
on the whole around different means. The spindle means 
themselves will again have a distribution, but their standard 
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deviation will now be made up of two parts. First of all there 
will be a part representing the standard deviation of sample 
means of 10 coming from one and the same spindle ; that part 

will be * where Oj is the average standard deviation 

VlO 

in samples of 10 specimens. 

. The second part will not be due any more to sampling 
fluctuations but to the real differences between the sample 
means, and will be called o-^. 

The standard deviation of the averages of spindles will 
therefore be made up of cr„ and It is actually the squares 

of these quantities which arc addidve, Isut for the moment the 
relation given above will do. 

Now it is clear that, as the control limits are calculated from 
the range of the samples of 10, there need not be a change in 
the position of control limits in this case, provided that the 
variation inside the samples of 10 remains more or less stable 
regardless from which spindles the samples are taken. 

But the picture of the Quality Control Chart with the 
average points plotted is now very different from what it was 
before. A number of points -will now fall outside the control 
limits because the variation of averages is now increased by the 
term cr^. It depends on the amount of this factor and thus on 
the amount of difference between the various spindles to what 
extent the control will be disturbed. 


X 


VlOOO 

'Ao 



VlOOO 


Fig. i.% 

Control Chart showing Process out of Control 


* Strictly speaking, a, should be corrected for sample size if it is to serve as an 
estimate of the population standard deviation, but this point is taken care of by 
the analysis of variance method. 
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4.3 Deriving the Quantities which are to be compared According 
to the Analysis of Variance Method 

In order to make the transition from the control chart 
method to the method of analysis of variance — which is nothing 
new in principle, but gives a very practical arrangement of the 
arithmetic — ^it is important to realize that if the standard 
deviation of the averages in a distribution under control was 
multiplied by it should be equal to the standard deviation 
in the population : 

^ P opulation 

^Averages ” 

therefore ■\/n ~ crp . 

If, on the other hand, the production was not under control 
and the picture presented by the control chart was that referred 
to above (i), then multiplying the expression : 


by Vn would give 

ffaV” + 

Cj here stands for the average standard deviation within a 
sample of n, and serves with a certain correction for sample size 
as an estimate of the standard deviation in the population, up. 

It Should be noted that in the analysis of variance we use 
instead of the standard deviations their squares, which are called 
variances (var), and the relations refen-ed to above then 
become : 


= Ur® (4.1) 

var^ = u/ ; .% nvar^ = . . (4.2) 


Now it is just the variance of the distribution of averages 
multiplied by n which in the analysis of variance is, being 
compared with the variance within the samples of n serving 
as an estimate of trp. 

The principle, which from the explanation given above is 
quite evident, is simply this : if n times the variance of the 
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averages approximates the variance within the samples of n, 
then there is no assignable difference between the samples 
— being sensibly zero — and the samples of n can be 
considered to come from one and the same population, which 
is equivalent to saying in quality control work that the process 
can be considered to be under control. The picture represented 
by the chart would be one of all the averages falling within the 
control lines. 

If on the other hand the variance of the sample means times n 
(or nvai\) is significantly different from that within the samples 
of n, the inference is that the samples come from different 
populations or, as we say in quality control work, that the 
process is not under control. The picture represented by the 
quality control chart would be one of a great number of 
averages falling outside the control lines. By significantly 
different populations is simply meant that the differences 
between the sample means are greater than can be accounted 
for by sampling fluctuations. 

In symbols we express the principle of the method of analysis 
of variance as follows : 

Var. of Sample Means . . Kvar^ ^ + 5,® 

Var. wthin Samples .... varj, 5/ 

The question of whether the difference between the variances 
is significant is decided by calculating the ratio of the variances 
and reading from certain tables (see Appendix X) the prob- 
ability that such a ratio could arise by mere chance. If that 
probability is very small, the criterion of smallness being usually 
less than 5 per cent., the difference is considered significant. 
If it is more than 5 per cent, it is considered insignificant. 
In the first case we would for our example conclude that the 
spindles are differently set. In the second we would conclude 
that the setting is, on the whole, on the same level (see 
Chapter VII, and the following paragraph 4.4, for a fuller 
explanation of the concept of significance) . 

To summarize : if everything was completely random or the 
process completely under control, then the variance between 
samples times n should not be significantly different from that 
within samples, and as each represents an independent estimate 
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of the variance of the total, the latter should not differ signifi- 
cantly from cither of the former. 

On the other hand, a significant difference of the variance 
between from that within samples (the latter being caRed 
the “ Residual ”), which again implies a diflcrence between 
either or both of these variances and that of the total, 
denotes a departure from control, or, what amounts to much 
the same thing, the presence of statistically different populations. 

In a similar way, as differences between the spindles might 
have an effect upon the variability of the product, so tire 
diflercnt shifts might have an efiect. Applying the same 
method as before, we should, if the different shifts work all in 
the same w'ay and produce an article of approximately the 
same quality, expect that n limes the squared standard 
deviation of the shift means should be approximately the same 
as that within the samples coming from each shift. If we were 
to find that this is not the case, and that the standard deviation 
between the shift means was significantly greater than that 
within the samples coming from each shift, we would conclude 
that the changing over from one shift to another implied the 
introduction of a disturbing factor. , 

4.4 The Parallelism Between the Two Methods — Statistical Signi- 
ficance 

The exact relation between the Analysis of Variance 
method and that of the Control Chart now becomes clear. 

In order to compare two variances, that of sample means 
and that of the individuals within samples, we must make 
them comparable. This can be done in two ways. We either 
multiply the variance of the sample averages by n, and compare 
it with the variance within samples, or we divide the variance 
within samples by n, after correcting it for sample size, and 
compare it with that of the sample averages. 

According to which way we choose, we obtain the quantities 
(mean sums of squares or “ variances ”) of the analysis of 
variance, or those of the quality control chart (frequencies 
of points within and without the control limits). 

In symbols we can express the principle of the control 
chart technique as follows for sufficiently great n : 

'"Hi* 
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1/6-18 of the total range of plotted points (averages) 



1/6-18 of the distance between 1/1000 control limits — >• 

V n 

If there is no significant difference between the samples, 
that is, if their averages differ by not more than can be accounted 
for by fluctuations of random sampling, will approach zero 
and the two ranges or distances will be sensibly ecjual, or, what 
is the same, all the points will fall within the control limits. 
Otherwise we say that the process is not under control or is 
unstable. 

The principal features of the Analy.sis of Variance and 
the Control Chart technique arc thus the same. In both cases 
the samples of n or the sub-groups arc selected in a non-random 
manner with respect to the total population. Both methods 
now provide tests of very similar mathematical stnteture for 
deciding whether the samples behave as random samples from 
one and the same universe. If they do, the universe is regarded 
as homogeneous or " under control.” 

There still remains the question to be answered, What 
do we mean by saying that variances are, or are not, “ signi- 
ficantly difl'erent ” ? 

Supposing the total of the product tp be homogeneous in 
the sense that variations in the characteristic measured are 
due to nothing but chance causes, then the two values : n var^ 
(sample means) and vara (within samples) should be estimates 
of one and the same quantity, viz. the true variance in the 
population. Because, however, of unavoidable fluctuations of 
sampling, they cannot be expected to be exactly equal. The 
question now arises as to what difference between them can be 
tolerated without compelling us to conclude that they are 
estimates not of one but of two different quantities, and, therefore, 
point to non-homogeneity of the product ? 

We turn again for information to the familiar quality 

* To be pvedse, since we correct the average standard deviation within samples 
for sample size, this term hecomes-2^ ► 
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control chart picture. The meaning of the 1/40 limit is that 
if the scatter of points exceeds these limits in a proportion 
greater than 2/40 — taking both sides together — we have 
sufficient grounds for suspecting the process to be unstable. 
As the proportion 2 in 40 corresponds to 5 per cent., these 
limits represent the values of the variable within which 95 
per cent, of the product should lie. The probability of a 
value beyond is 5 per cent. Such values should thus have a 
probability of occurrence of 5 per cent, or less if the assumption 
of homogeneity was to be upheld with confidence. A prob- 
ability of occurrence of more than 5 per cent, for values beyond 
thc.se limits would make us smspect that assumption. 

Similarly, the probability of a point beyond the 1/1000 
limits is — ^for both sides together — 2 per cent. If the scatter 
of points is such as to overstep that amount, this is again an 
indication that the assumption of homogeneity or “ control ” 
is to be abandoned. But it should be noted that as we hardly 
ever have 1000 points on the chart, any point falling outside 
the 1/1000 limits is to be taken as a definite indication that the 
process has become unstable. 

Thus in the control chart technique we compare two values 
of the measure of dispersion, one obtained from the “within 
sample variation” (limit lines), and the other from the " between 
sample variation” (scatter of points), just as we do in the 
analysis of variance technique. 

The occurrence of points beyond the inner or outer limits 
in a higher than the assigned proportion, implying that there 
is a probability of more than 5 per cent, or -2 per cent, respec- 
tively of such outliers, is then taken to be incompatible with 
the assumption of stability or “ control.” 

We can also express this by saying that if the frequency of 
occurrence of values beyond the 1 /40 limits exceeds the 6 per 
cent, level (or that beyond the 1/1000 limits the -2 per cent, 
level), then the assumption of all the samples having come from 
one and the same universe is no longer justified. 

AH this is, of course, nothing but a different expression 
of the familiar rule of the quality control technique, accord- 
ing to which the process is judged to be under confrol if not 
more than 1 in 40 points falls outside an inner, and not more 
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than 1 in 1000 beyond an outei’ limit. Convenscly, we conclude 
that the process has become unstable due to assignable causes 
if points occur outside these limits in greater than the specified 
proportions. 

Considerations such as these are used wherever the question 
of “ significance ” comes up. 

The two quantities which in the analysis of variance 
technique we call “ Var. between samples ” and “ Var. within 
samples ” should not differ by more than is compatible with 
their being estimates of one and the same quantity, if we are 
to be satisfied that the population is homogeneous-random. 

The fundamental significance level is taken at 5 per cent, 
corresponding to our inner control chart limit. If the difference 
(it is actually the ratio which is considered in the analysis of 
variance technique) is such as could occur at least five times in 
a hundred in samples of a given size from one and the same 
universe, then we are satisfied that they are not “ significantly 
different,” or that there were no other than random causes 
active in producing the variability of our product with regard 
to a certain characteristic. Conversely, if the difference was 
greater than is compatible with the 6 per cent, level, we con- 
clude that these samples could not have come from one and 
the same universe. 

Instead of the graphical aid provided by the control 
chart technique for deciding the question of significance, the 
analysis of variance technique provides us with tables (Fisher’s 
.^-tables or those of the Inverted Beta-Function — see Appendix 
X) from which to read the answer to our question. These 
tables have sections for different probability levels. Those 
most often used arc the 5 per cent., 1 per cent., T per cent, 
sections, of which the first corresponds to the level of the 
inner limits (both sides together) and the last to that of one 
outer Hmit.* 

The lower the probability level to which the difference 
between the two variances corresponds, the more confident 
can we be that the difference is significant. Just as in the 
control chart technique the overstepping of the inner limits 
is taken as a warning, whereas that of the outer limits as a 

* see Appendix X, which give the three sections in one table 
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definite indication that the process has iDecome unstable, so 
in the analysis of variance method the 5 per cent, level repre- 
sents the border-line between significant and insignificant 
differences) w'hereas a difference corresponding to the 1 per 
cent., or better still the -1 per cent., level is a very strong 
indication of non-homogeneity. 

4.5 Illustration for Analysis of Variance of Samples Coming from One 
and the Same Population 

The following experiment may serve as an illustration of 
what has been said in this chapter. 

Blue paper discs were stamped out in continuous production 
to a nominal dimension of their diameter of -2630 to •2(140 in. 
These discs were measured by an experienced examiner on a 
“ Travelling Microscope ” for diameter. 

The first part of the experiment consisted in making the 
examiner obtain 100 determinations of the diameter of one 
and the same disc. T!te result was, as expected, an approximately 
normal distribution of obseivations. The measurements so 
obtained were now grouped into samples of 10, in the order 
in which they were taken, which gives 10 samples of 10 units 
each. There is no apparent reason why these samples of 10 
should differ fi'om one another with regard to their averages 
and standard deviations by more than can be accounted for 
by mere chance fluctuations. But there is always the possibility 
that they do, owing to differences in precision on the part of 
the examiner from one sample to the next (Table 4.1«). 

We know from paragraph 4.3 how to test the hypothesis of 
randomness. , We compare n times the square of the variance 
of the sample means with the average variance within the 
samples. If they are approximately equal we conclude that 
there is no reason to assume the presence of a disturbing cause, 
and conversely, if they are significantly different we suspect 
the presence of such a cause. We thus calculate first the 
average standard deviation of the samples of 10 and square, 
and the standard deviation of the sample means, square, and 
multiply by 10. 

The quantities so obtained are tlien arranged in a form 
peculiar tb the analysis of variance method. Let us recall 
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formula 4-2 where the variance of the total was shown to 
consist of two parts ; variance between sample means and 
variance within samples. We can write this a little more 
explicitly ; 


S(x — x)^ _ S(X' — .r) SS(r — x,) _ ^ ^4 3^ 

N m nm 

where x stands for a measurement, x for the average of the 
total number of measurements (N), ic, for a sample average, 
m for the number of such samples, n for the sample size, S for 
the operation of summing, and SS for summing first all squared 
deviations within a sample of n and then summing for all the 
m samples. 

Multiplying through by N, the total number of observations, 
and considering that N = nm, we obtain 

S(x - = nS(x, ~ x)® 4- SS(k — . • «• 

That is, we have now expressed the fundamental condition 
for our analysis in terms of the Sums of Squares instead of 
squares of standard deviation, s. The formula (4,4) is for the 
purpose of the analysis written in table form as follows : 


Sources 

Sums of 
squares 

Degrees of 
freedom 

Variances 

Between rows . 

— ay 

m — 1 

imy + a/ 

Residual . 

ss{x-a,y 

m(ii — 1 ) 


Total 

s(,x ~xy 

N~1 

a. 


where the " degrees of freedom " are nothing but the number 
of items in each group minus 1, by which the sums of 
squares are to be divided in order to make us arrive at the 
mean sum of squares or v^-riances which are to be compared for 

significance. 

(612) 


7 
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For our experiment the following table resulted (in units of 
.0001 in.). 


Table 4.1 


Sources 

Sums of 
-squares 

Ucgiees of 
freedom 

Variances 

Between samples 

17t 

9 

19-3 

Rejsiclual (within 
samples) 

1724 

90 

19-1 

Total , 

18!I8 



9!) 

19-2 


Table i.Xa gives a simplified process of computation. 

Tabix 4.1« 

To simplify the arithmetic, '2630 was subtracted from the original measure- 
ments, -2035, '2(141, '2641, '2040, -2044, etc., and the following table resulted. 


Sample 

Number 

10 Measurements 

Row 

Totals 

1 

1 

5 

11 

11 

10 

14 

12 

14 

10 

9 

15 1 

111 

2 

13 

H 

10 

4 

10 

8 

7 

18 

16 

17 

119 

3 

S 

IS 

7 

10 

8 

8 

9 

7 

10 

9 

91 

4 

! 1,3 

10 

11 

8 

16 

16 

12 

12 

9 

16 

121 

6 

f 14 

13 

0 

3 

7 

11 

f) 

20 

7 

13 

96 

0 

11 

12 

11 

4 

15 

3 

10 

9 

12 

5 

92 

7 

1 

IS 

7 

15 

0 

8 

15 

2 

13 

14 

117 

8 

[ 10 

14 

9 

« 

8 

9 

11 

10 

8 

G 

93 

9 

1 9 

12 

17 

10 

19 

15 

10 

5 

18 

3 

124 


I 

4 

13 

17 

15 

9 

10 

6 

17 

16 

122 


1 

1086 


Sim/'lified Process of Computation 

(1) Square the Individuals and add : 

6* 4- lU + lU + ... 17 a 4- 16* = 13692 

(2) Square the Row Totals and sum, and divide the result by the number 
of Individuals per row : 

ltl» + ns® + . . . 124» + 1222 119.082 

119-682 


10 


110-6S2 


(3) Obtain the Grand Total for all Individuals, square and divide by the 
total Number of Observations i 

== 11.794. 

loo 
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Then, if (1), (2), (2) stand for the results of the above operations, equation (4.4) 
becomes ; 

[(1) - (3)]i = [(2) - (3)] + [(1) - (2)], 
or 

’ 1898 = 174 -t- 1724, 

which quantities now represent the Sums of Squares for Total, Rows, and Residual! 
respectively, entered in Table 4.1. 


The variances between the samples, within the samples, 
and of the total are very much alike. This confirms the 
assumption that there was no difference in operation as 
regards the examiner’s work from one sample of 10 to the 
next. 

In order to satisfy ourselves that the numerical results of 
the analysis correspond exactly to what was said about its 
logical foundations, let us determine the relevant quantity, 
viz. the standard deviation of the sample means, in two ways. 


i644 ■ 

i642- 

2640 

2638- 

2636 


'/lOOO 

'Ao 

M 

Ao 

'/looo 


Fig. 4.3 

Control Chart for Averages of 10 Measurements taken all on one 
and the same Disc 


First by calculating it as the standard deviation of the means. 
We obtain for its square as seen in row 1 the value of 19'3/10 
= 1'93. And secondly, according to the formula for the 
standard error, which, as seen from row 3 gives the value of 
19-2/10 = 1-92 for its square. It is thus the fact that between 
the sample means there was no greater variation than that due 
to chance only which made us arrive at the result that there 
was no disturbing cause present. 

Using the graphic method of the quality control chart, 
f we plot the averages of our samples of 10 and draw control 
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limits for the averages whose position we calculate from the 
range within the samples of 10. Diagram Fig, 4.3 presents 
the picture of a very satisfactory control. 


4.6 Illustration for Analysis of Variance of Samples Coming from 
Different Populations 

In the second part of the experiment 10 different discs were 
subjected to measurement, and each disc was measured 10 
times. We have thus again 100 measurements as in the 
former case, but this time 10 really different samples of 10 
because they represent the measurements taken on 10 different 
discs. 

In this case we should expect that the variation between the 
samples meaiis to be greater than the variation due only to the 
standard error of the average of repeated measurements. 
Arranging the arithmetic again as in the first part of the 
experiment, viz. first calculating the sums of squares of devia- 
tions within the samples, and then calculating the sums of 
squares of deviations of the sample means and multiplying 
by K, we obtain the quantities of the second column of the 
following table (in units of *0001 in.). 


Tame 4.2 


Source of 
var. 

Sums of 
squares 

Dc'giees of 
freedom 

Variances 

'Var. 

ratio 

Discs (betweeu 
sainpies) 

200 

9 

22-2 

2-06 

' Withiu samples 
(residual) 

070 

00 

10*8 


Total . . 

1170 

99 

11-8 



The simplified process of computation is shown in 
Table 4,2a. 
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Tadll 4.2i2 

For the purpose of simplifying the arithmetic, *2030 in, was subtracted from 
the 4 )riginal measurements, -21)34, -2635, -2034, -2638, etc., and the following 
table resulted. 


Disc. No. 

Sample? of 10 measurements 

Row 

totals 

1 

4 

5 

4 

8 

7 

2 

11 

2 - 

~1 

7 

49 

2 

— 

-1 

5 

18 

n 

5 

4 

0 

(> 

2 

52 

3 

— 

4 

1 

4 

15 

4 

0 

3 

— 

2 

38 

4 

11 

1 

7 

14 

10 

8 

10 

G 

10 

4 

81 

5 

3 

10 

0 

10 

— 

6 

5 

11 

15 

5 

71 

0 

1 

G 

G 

5 

3 

— 

8 

9 

fl 

G 

50 

7 

3 

4 

7 

5 


8 

G 

5 

2 

7 

45 

8 

7 

(i 

10 

8 

9 

5 

13 

4 

7 

7 

70 

9 

G 

1 

;i 

4 

2 

8 

1 

G 

5 

(1 

42 

10 

0 

9 

7 

6 

1 

9 

— 

0 

— 

10 

54 



568 


Simplified Process of Computalion 


(1) Square the Individuals and sura : 


42 + C2 + . . . 6® + IQS = 4283. 

(2) Square the Row Totals and sum, and divide the result by the number 
of Individuals per row : 

349's + 3582 q. . , . 228® + 430“ = 33132 


.33132 

10 


3313-2 


(3) Obtain the Grand Total for all Individuals, square, and divide by the 
total number of observations ; 


6.58= 

100 


= 3113-G4. 


Then, if (1), (2), (3) stand for the results of the above operations, equation (4.4) 
becomes : 

[(1) - (3)] = [(2) -(3)] + [(!)- (2)] 

or 

1170 = 200 + 070 

which quantities now represent the Sums of Squares entered in Table 4.2. 


The degrees of ixcedom are found in the same way as 
in the former case. We have 10 samples of 10, therefore the 
mean sum of squares of the sample mean deviations is obtained 
by dividing through by 10 — 1 = 9. In a sample of 10 there 
are 10 — 1 arbitrary observations, which gives 9 observations. 
Multiplying this figure by the number of sampled we get 90, 
giving the degrees of freedom for the second row. For the 
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total we have 100 — 1 = 99, the sum of the degrees of freedom 
in rows 1 and 2. 

The last column contains the variances which arc to be 
compared. Wc find this time a significant difference between 
the variance of the means and that within the samples of 10. 
The ratio between the two is 2-06, for which we read in the 
tables of the variance ratio,* that a difference such as this 
between two variances would arise less than 6 times in 100 
by chance. Wc thus conclude that this is due to a real difference 
between the sample means. The analysis has thus brought 
out what in this case we knew from the beginning, viz. that 
there were real differences between the samples of 10. 

If we now wanted to obtain the contributions to the total 
variation by the accidental variation and the real variation 
between the measurements, we should have to proceed in the 
following way. 

The variance “between” the samples is made up of two 
parts, viz. 10 times the square of the standard deviation between 
the discs, plus the residual. 

We thus obtain 22-2 = lOco® + from which 
= 1T4 and Ca = 1’07. The standard deviation of the means 
is then made up in the following way ; 


therefore, 1-14 -f- (10’8 4- 10)= 2-22 

the first term showing the contribution of the real, and the 
.second of the chance, variation. 

2*22 is the total standard error of the means, due to fluctua- 
tions of sampling and the real differences. It is, of course, 
nothing else but the variance between samples 22-2 divided 
by the sample size of 1 0. 

On the other hand, the standard deviation of the total should 
also be made up of two parts : the squared standard deviation 
of discs, plus the squared .standard deviation within the samples 
of 10. We obtain from the foregoing ; 


M4 + 10-8 = 11-94 
• Appendix X 
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Tliis is in good harmony with the variance calculated 
from the total of observations "which we find in the last row of 
our table, namely, 11-8. 

In order to represent the foregoing analysis graphically, 
let us again plot the averages and calculate the control limits 
for these averages from the sample ranges. The following chart 
points unmistakably to a lack of control, which would become 
more pronounced as the observation was prolonged. 



Fio. 4.4 


Control Chart for Averages of 10 Measurements, each Sample of 
10 taken from a different Disc 

4.7 Estimatmg the Amount of Variance Contributed to the Total 

Variance by each Component Distribution or Cause — The Slide 

Rule Principle of Probability 

The question now arises as to what use we can make of the 
results of analysis of variance, apart from discovermg whether 
two samples come from tire same population or not. The 
answer is briefly tlris — ^that it allows us to assess the effect of 
each of the disturbing factors, which is a prelinunary for 
removing them, or if we are not able to do this, for using a 
corresponding correction in our final estimate of the quality of 
the work. 

The principle I am going to explain now is one of the most 
important in statistical theory, and at the same time one of 
the most astonishing. 

Let us think of avariable characteristic which is the combined 
effect of a number of contributing causes. To fix our ideas, 
let us speak of a certain dimension (diameter) of the article 
produced on our five-spindle automatic in three slfifts. Let us 
further assume we had established by analysis of variance that 
into the variation of that dimension there enter the causes of : 
difference between spindles, difference between shifts, and, of 
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course, random errors. This means that the dimension in 
question of each individual article will be the result of a com- 
bination of the two assignable causes, plus chance causes. As 
each of the “ causes ” is a chance variable, that is, a varicfble 
which can take on different values with different probability, 
we must think of three frequency distributions compounded 
into one. The best description for what happens in such a case 
is provided by a term sometimes used colloquially in connection 
with the assembly of two types of components : we say that the 
components are being “ married ” to one another. 

Now this is precisely what happens to the distribudons, if 
by “ marrying ” we mean “ mating.” It is not a question of 
merely adding the frequencies of a given value of the variables 
in all three distributions. By no means. Something much more 
complicated happens. Provided that there is no correlation 
between the variables, any value of a chance variable can meet 
with any value of another chance variable. A high value of 
variable 1 will meet sometimes with a high value of variable 2, 
but can equally well meet with a low value or a medium value. 
Even here, among inanimate objects, “ mating ” is not always 
“ matching.” The result of compounding chance variables is 
thus not a summing of frequencies only, but a sort of multiplication 
in thought. Any value of variable 1 has an equal chance of 
meeting with any value of variable 2, which forcibly reminds 
one of the inalienable right of the hereditary units, the genes, 
to combine freely according to the Mendehan Laws of Heredity, 
which are nothing but Laws of Chance. 

But how are we to follow an operation of such complexity 
between our chance variables, and is it conceivable that we 
should be able to predict the properties of the resulting com- 
pounded distribution ? 

The answer is that this has been made possible by a 
method analogous to that which has made multiplication easy. 
This, as is well known, has been achieved by means of logarithms. 
Let me recall the principle of it as it is embodied in the Slide 
Rule. 

If we want to multiply, say, 3 by 4, we set the lower scale 
so as to make its 1 coincide with the 3 of the upper one and then 
read off the upper scale the figure 12 which coincides with the 
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4 on the lower. We have thus merely added two distances 
instead of multiplying 3 by 4, because the scales are logarithmic 
scales, and two numbers are multiplied by adding their 
logarithms and raising their “ base,” 10 as a rule, to a power 
equalling the sum of their separate logs. 

A similar device happens to be available for the compound- 
ing of frequency distributions. If we want to obtain the standard 
deviation of the result of “ marrying,” that is, multiplying in 
thought, or, which is the same, in a random way, a number of 
component distributions, all we have to do is to add their 
variances (squares of standard deviations) and take the squai e 
root of that sum. We thus obtain tlic result of a most com- 
phcated “ multiplication ” simply by summing the squares of 
certain distances epitomizing these distributions. 

We know, therefore, that if a number of distributions of 
independent causes for one and the same characteristic, but 
which are significantly different in the sense explained above, 
are compounded into one single distribution, then the standard 

deviation of that distribution : 

o total— V sum of variances of component distributions (4.6) 

In terms of the Theory of Statistics, the theoretical justification 
for the interpretation given above is as follows (as far as normal 
distributions are concerned). 

The Generating Function G (l) = fp{x)Pdx is transformed into 
the Moment Generating Function by putting ^ = e“. We then 
obtain 

G(e“) ^ fp{x)e’‘’'dx = 1-1- -f ■ ■ ■ 

The logarithm of that function can be expanded in powers of 
a in the form + A2a®/2!-)-Asa®/3!q- . . , (seminvariant generating 
function) where the coeiEcients A,- are called the seminvariants of the 
function p{x). Now since moment generating functions, like simple 
probabilities, are compounded in product, seminvariant generating 
functions are compounded in sum. Hence the theorem : when 
independent systems are compounded, the rth seminvariants of 
the separate systems are added to form the rth seminvariant of the 
compound system. This is called the additive property of the 
seminvariants (see Aitken, Statistical Mathematics, pp, 21, 22). 

Now the second seminvariant is always the square of the standard 
deviation, and, therefore, by the above theorem the variance of the 
compound system is equal to the sum of the variances of the com- 
ponent systems. 
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Further, as for the Normal distribution — and it is more or less 
Normal distributions which arise in industrial work — all semin- 
variants beyond the second are zero, and as the origin in all these 
distributions can be taken at zero, we can conceive of the c<;^m- 
pounding of independent normal systems as being carried out by 
adding the logarithmic expressions for these systems and finding 
the antilogarithm to that sum in the form of the M.G.F. of the 
compound system. 

4.8 Application of this Principle to Industrial Data — Estimating the 
Amount of Variance Contributed by Divers Engineering Causes 

Let us now apply this to the example under consideration. 
Let us call the .standard deviation due to differences in spindles 
Ci, that dne to shifts or machines o„, and that within our 
samples of 10 a,. Furthermore, let us call quite generally the 
number of spindles n^, the number of shifts or machines n, and 
the number of at tides in a sample K 3 . Wc then have from 

formula (4.6) the relation 

Standard deviation of total = . (4.6) 

The question is now how to estimate the a’s on the right- 
hand side of the equation. This is easily done if we bear in 
mind the fundamental principle of the analysis of variance as 
explained above. Supposing we wanted to know whether there 
existed a disturbing factor due to spindle differences. We would 
then calculate the means of the product coming from each 
spindle, the standard deviation between these spindle means 
as if they were individual measurements, and multiply its 
square by n, the number in the sample we were regularly 
taking. The value so obtained we would then compare with 
the square of the average of the standard deviations within 
the samples of n, as described above, and draw our conclusions 
according to whether these two values are sensibly equal (no 
disturbing factor) or significantly different (disturbing factor). 
The first of these values approaches for very great n 

% ^ 

the second 

We write this relation in the form of equations Vx — n^ox'^ + Sa® 

and z/j ~ 

^ . (4 7) 

«s Ka 


and have from them ui® 
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This applies to any number of suspected disturbing factors 
(shifts, machines, etc.) and puts us into a position to determine 
the contribution of each disturbing cause, to the total varia- 
tioh in the product. The sum of variances ((7,-“’s) thus 
obtained, plus the residual representing the variance due to 
random errors, should then be sensibly equal to the variance 
of the total, according to formula (4.6). This puts us in a 
position to describe the relative or percentage amount of 
contribution by each of these factors to the total variation. 

It should be noted, however, that the “ variances ” of 
the table of analysis of variance, the “ v’s,” arc not additive 
in the sense that they would add up to the variance of the 
total, because each " variance ” there (each v^) is an inde- 
pendent estimate of the variance of the total, provided that 
their differences are not significant. 

4.9 How Such Information Can be Used for Improving the Method 
of Control, and, Thereby, for Improving the Quality of the 
Product 

Apart from furnishing important information of that kind, 
we can turn the results of analysis of variance to good use for 
the purpose of controlling quality by means of Charts more 
effectively. 

'With that end in view, a little transformation of the funda- 
mental formula (4.6) becomes necessary. What interests us 
in quality control work is sample averages. As we have seen, 
an average in our example might be affected in three different 
ways. It might deviate from the true mean because of the 
effect of the cause “ spindle.” It might deviate from it as the 
effect of the cause “ shift ” or “ machine.” And it will deviate 
from the true mean because of random errors or fluctuations 
of sampling. 

What we want to know is the standard deviation we must 
expect for sample averages which are affected in this threefold 
way, if the product coming from all machines and spindles is 
compounded into one total distribution and the sample picked 
at random, 

Using the same symbols as before, we know that the standard 
error of the spindle means will be ; 
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that of the disU'ibutioii of the shift, or machine, means : eta 
and that due to fluctuations of sampling : ffg/ ^n.^. 

As the standard error of the mean is nothing else but the 
standard derdation of the distribution of means, we obtain the 
standard error of the compound distribution of means by apply- 
ing the formula (4.5), inserting in place of the standard devia- 
tion 03 of the frequency distribution, the standard deviation of 
the means of that distribution, thus : 

Standard Error of total =v + '^ 2 “ + 

What use can we make of formula 4.5 and 4.8 in Quality 
Control work ? Let us first assume that it is feasible to remove 
the disturbing factors. The engineer, after having been in- 
formed of the results of our statistical investigation, might arrive 
at such a decision. The question now arises as to what to take 
as a basis for calculating our control lines. Clearly, if we 
succeed in eliminating the influence of the disturbing factors, 
we should be able to work to control limits governed by nothing 
but random errors. The residual, ca®, being the square of the 
standard deviation of that random error distribution, thus 
represents the quantity from which to obtain our control limits. 
(Take square root and multiply by factors given in Appendix 
IV(a) or multiply Cg/V^ by 1-96 for the 1/40, and by 3-09 for 
the 1/1000 limit.) 

The second possibility is that nothing can be done about 
removing the disturbing factors. In such a case we must work 
to somewhat wider limits if we wanted to use our, charts as 
efficient control charts at all. 

The limits calculated from the range of our samples of n 
would evidently be too narrow because they leave out the 
increase in variation due to the influence of the disturbing 
factors, and would thus give too many points outside. This 
in turn would necessitate adjusting the machines or spindles 
very often for causes which, according to the hypothesis, it was 
impossible to remove. The net result would he that instead of 
an improvement in the quality of our product, we would only 
achieve a deterioration. But provided the specifications for 
our characteristic do not forbid any extension of the range of 
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measurements, we can from formula (4.8) obtain control lines 
spaced so as to embrace the great majority, if not all, of the 
averages. What wc have to do is to take the standaj'd erroi of 
the^otal as calculated there as the starting-point from which 
to calculate our control limits. (Multiply the standard error 
by 1*96 and 3-00 and add ot subtract fi'om average of averages.) 


4.10 Detection of Variations in Setting 

As an illustration for the use of formula 4.8, let us return 
to the operation discussed in Chapter II, viz. the presswork 
operation of setting percussion caps to a specified depth. Wc 
shall see that formula 4.8 enables us to arrive at a physical 
interpretation of gicat practical interest of the difference 
between the total variance of sample means and the variance 
from which the control limits are obtained. 

If the dimension which is to be controlled depends, as it 
does in this case, upon the correct setting of the machine, then 
it is clear that the extent of deviation from the specification 
mean will depend, not only upon the chance deviations from 
that mean, but also on the setter’s ability to keep close to the 
specification or drawing average, and on the influence of tool 
wear which might cause the setting to drift in one direction 
instead of remaining fixed. 

In brief, the settings themselves will have a frequency dis- 
tribution around the specification mean. This distribution 
we must imagine to be superimposed upon that of the chance 
variations of the product around a given setting, the former- 
having its source in the machine plus servicing, the latter in the 
quality of the components which are to be assembled plus the 
operation of the machine. 

Assuming the setting to remain fairly constant while small 
samples of, say, five units are taken, which is equivalent to assum- 
ing that the range is under control, then the variation within 
such samples transformed into that of the population of sample 

means according to the formula will be a good measure 

u„Vn 


of the variability of mcarrs due to the quality of components 


and the operation of tire machine. 


s/fi 


is, of course, nothing 
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else but the standard error of the mean in a normal population 
which, multiplied by 1.96 and 3-09, gives our 1/40 and 1/1000 
limits respectively. We can also describe it as that part of the 
standard deviation of means which is due to chance. ^ 

Now from paragraph 4.8 we know that according to the 
slide-rule principle of probability, the total variance of sample 
means (that is, the sc[uare of the standard eiTor of means) 
equals the sum of the variances of the component distributions 
of means. In our case (see formula (4.8)). 

St. Error^ (tot.) = St. E.“ (chance dev.) -h St. E.^^ (settings) 
1 1 follows that the dilferencc 

St. E.-^ (tot.) - 

will give the variance of sample means due to setting (the range 
assumed to be under control). This, then, is the physical inter- 
pretation of the difference between the total variance of sample 
means and the variance upon which the control limits arc 
based. 

We obtain this quantity either by an analysis of variance 
calculation of the type carried out above (paragraphs 4.6, 4.6), 
or directly from the control chart for averages by determining 
the standard deviation of all the points plotted (for a rough 
approximation we could divide the total range of plotted points 
by 6), squaring, and substracting from it the square of 1/6 of 
the distance between the 1/1000 control limits. 

The magnitude of the quantity so obtained, relative to the 
total variance, is a measure of the extent to which setting 
variations interfere tvith the uniformity of our product. If it 
is considerable, and if tire setting variations cannot be reduced 
satisfactorily, we must allow for them in setting our control 
limits by increasing the distance due only to chance by an 
appropriate amount. 

To give an idea of the extension of control limits for 
averages in such cases, it may sufBce to say that very often a 
distance (of the 1/1000 limits from the mean) of four instead 
of three times the standard deviation of averages will be found 
necessary for accommodating practically all sample averages. 
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4.11 Multifactorial Design 

In paragraphs 4.5 and 4.6 the analysis of variance method 
was used for determining the dependence (or otherwise) of a 
certain effect upon the variation in one cause. Very often, 
however, the question arises of whether a given effect is de- 
pendent upon a number of causes, and also, how such causes 
are interlocked with one another in producing the effect. 

The analysis of variance method, altlrough perfectly able 
to provide a complete answer to problems of this kind — in fact, 
it is then that it shows its full power — becomes rather com- 
plicated in such cases. Performing a difficult analysis without 
mastery of the analytical relations involved, brings with it the 
risk that it miglit degenerate into a mechanical procedure 
whose working is a mystery to the peison applying it. 

The difficulties of carrying out a complex, that is multi- 
factorial, analysis are partly due to the usual method of doing 
the various parts of the analysis as far as possible simultaneously, 
for the sake of economising in arithmetical labour. Since, 
however, sacrifice of economy in arithmetical work is preferable 
to sacrifice of understanding, a slight modification of the 
original method. is set forth in tliis paragraph, which, by taking 
out the effect of one cause at a time, and tlius proceeding step by 
step in the logical order of what we have made it our aim to 
achieve, demands at each step nothing more than the simple 
analysis of variance explained in paragraphs 4.5 and 4.6. It 
has thus the effect of resolving the multi-variate case into a 
sequence of univariate ones. 

An experiment was carried out with a view to testing the 
protection against corrosion of steel afforded by coating treat- 
ments of the steel (“ undercoats ”) and paint systems. Five 
different treatments were given to equal numbers of steel panels, 
which were further protected by five different systems of paint, 
the number of panels for each combination of a given treatment 
with a given paint system being the same. These panels were 
then subjected to weathering influence (natural or artificial) 
and, after a certain time, the “ amount of corrosion ” ascer- 
tained, as epitomized by the area and the depth of the corrosion 
spots occurring on a panel. The " marks ” which the panels 
thus received were averaged for the panels belonging to the 
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same combination of treatment and paint system. In the 
following Table 4.3, the rows denote the five treatments, thc' 
columns the different paint systems, whereas the entries in the 
body represent the “ amounts of corrosion ” for each row 
column combination, in teims of the adopted system for 
evaluating the results. 

Table -t.S 


Treat- 








nif*nt 

0 

1 

■2 

3 

4 

.Sums 

Means 

Paint 

system 








0 

2.') 

1.-) 

lit 

Ui 

13 

82 

10 4 

1 

20 

10 

17 

13 

10 

70 

15-2 

2 

1!) 

111 

11 

13 

0 

71 

14 2 

3 

IS 

11 

It 

a 

7 

01 

13'2 

4 

a 

V 

11 

4 

10 

44 

8'8 




06 


40 






13-2 


9 8 




We will start with ascertaining whether the dififerences in 
the amount of coiTosion, as we proceed from one treatment to 
the next, point to significant differences in those causes, in 
other words, whether we can rely on certain treatments giving 
better protection than others. 

The following arrangement of the arithmetic is only 
another expression of that set forth in Table i,la, but it is 
to be recommended in the case of multi-variate analysis 
because it is less liable to confusion. The steps are : 

(1) Form the sum of squares of all entries, namely 4984. 

(2) Form the sum of the five products : row sum by row mean, 
4639-6 in our case. 

(3) Each of these sums must be corrected for transference to 
the general mean by subtracting the product ; total sum 
by total mean, 4462-24. 

This gives the equation (1) — (3) = [(2) — (»3)] 

+ [(1) *" (2)]) or, numerically 
521-76 = 177-36 -f- 344-4 


. . m 
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where the first term is the sum of squared deviations from the 
general mean, the second represents the variation “ between ” 
rows, and thus is the sum of the squared deviations of the row- 
meS.n3 from the general mean, and the third is the sum of the 
squared deviations from the row-means, and thus the variation 
“ within ” rows. . 

This represents a partial result which cannot be interpreted 
in physical terms until the other variable causes which we wish 
to consider have been dealt with. 

In order to ascertain whether we can rely on certain paint 
systems to give better protection than others, we must now 
analyse the data with regard to columns. 

We first subtract from each entry in Table 1.3 the corre- 
sponding row-mean, and obtain the following Table 4.4 : 


Table i.4 



We then treat this table in the same way as the original 
table — ^that is, wc deal with its entries as if they were the 
original ones — ^with the only difference that we now determine 
the variation between columns instead of between rows. 
(Proceed according to the steps given above, substituting 
" column ” for “ row.” Since Ae overall mean of Table 4.4 
is zero, step three leaves the sums of squares unchanged.) 

This leads to the equation 

344-4 = 220-9 -f 123-5 

(m) 


, (4.10) 
8 
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Comparing (4.10 with 4.9), wc see that what we have done in 
the second part of our analysis was to resolve the variation 
“ within ” or the residual of (4.9) into two parts, viz. the 
contribution of the column effect and another (reduced) 
residual. 

Combining both equations, wc have 

521-70 = 177-36 + 220-9 + 123-5 . . . (4.11) 

Let A, B, G, D, E, in the following “ Latin Square ” 
represent levels of a third cause, say live different types of steel 
used for the panels : 

ABODE 
E C A B D 

B D E G A 

D E B A G 

G A D E B 

Summing and averaging the five entries in Table 4.3 corre- 
sponding to each of these letters, we now proceed exactly as if 
they were row or column sums and averages. We first subtract 
the column means from the entries in Table 4.4, which gives 
us a further reduced table to which to apply the simple analysis 
of variance procedure according to the steps set forth above, 
substituting “ letter-mean or -sum ” for row-mean or -sum. 

The result would be an equation where the last residual of 
123-5 would be split up into the “ letter ” — that is, the type of 
steel — contribution and a final residual. 

Taking, however, only two causes into account, ti-eatment 
and paint system, we can now set up the following analysis of 
variance table from equation (4.11) : 


Source 




Prob. 

Treaiment 

I7T-36 

4 

44-3 

<•01 

Paint 

220-8 

4 

53-2 

^•01 

Residual 

123-tt 

le 

7-7 


Tswal 

B2I-76 

24 
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In the last column we see that the difi'ei’ences in the degree 
of corrosion between panels subjected to cither different treat- 
ments or different paint systems are such as would occur on 
pure chance less tlran once in a hundred. Neglecting such 
small probabilities, we conclude that the various treatments 
and paint systems really differ with regard to the degree of 
protection which they afford against corrosion. 


For the reader interested in the mathematical side of the matter 
I give below the analytical structure of the modified method. 

Denoting the single observation possibly depending upon two 
causes by j).„ each subscript standing for one of those causes, the 
respective averages by y, and ji, and the grand average by y, the 
number of rows by h and that of columns by Ay we can write for the 
Standard Deviation squared : 


I 


A - - -jq— 


\ N / (See Appendix II) 


= . . .(4.12) 


In words : Total sums of scjuares = sums of squares of 
entries Table 4.3 — Prod. (Total Mean by Total Sum). 

This represents the combination of Steps 1 and 3. 


(b) 2S(y „ -yy = i:2(y„ -y,y kE(y, -y^ 

(See equation (4.4)) 

Xi:(y.,--y)^-i-L(y-yy = SS(v._,-y,y . . . .(4.13) 
Now since 

S(i. -i)^ = - (^') 7 ^ 

= 

Letting the letter h stand for the number of rows and k for the 
number of columns, this means in words : 

Sum of squares “ between rows ” == sum of products (Row 
Sum by Row Mean) — product (Total Sum by Total Mean). 
This represents the combination of Steps 2 and 3. 

11 According to equation (4,13), subtracting (4.14) from (4.12) 
is equivalent to forming the quantity SS{yj/— -Ji)* or the 
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residual. We obtain this quantity numerically from Table 
4,3, by subtracting liom each entry the corresponding row- 
mean, squaring, and summing over all entries. 

Repeating now for the columns (subscript j) what we have 
done for rows (I), and remembering that our basic quantities 
are now the (ji„ — j,) differences, we have (A denoting the 
number of rows) ; 

-ir - AS a -jy- = -yr • • 

or 

In words : 

Sum of Squared Entries (Table 4.4) — Sum of Prod. (Gol. 
Mean t;y Gol. Sum) — (Tot. Sum by Tot. Mean) Residual, 
We see that the result of the operation is to reduce the 
entries in 'I'able 4.4 still further by the corresponding 
Column Means, 

Carrying out Parts I and II simultaneously, we write 
equation (iva) as 


S(;-, ^yy-kH (y, ~y) * - hm -y) ^ = SS {y, -y, -y ,) "■ 
or 2S;;,/ - (h^y.^ - 

= m{y.,-y.~y,)^ (4.16) 


In words : 

Sum of Squares — (Total Sum by Total Mean) — [Sum of 
Prod. (Row Sum by Row Mean) — (Tot. Sum by Tot. 
Mean)] — [Sum of Prod. (Gol. Sum by Gol. Mean) 
— (Tot. Sura by Tot. Mean)] = Residual. 


The process could now be repeated for a third variable 
cause, described by a letter subscript L, leading to the 
equation 


~y)^ ~ k{y, —y)^ - —yy-l{^j. -yy 

» SSS(^„, -i, ~y, -J^y (4.17) 

and, indeed, for any number of variables, each step implying 
a further reduction of the original table. We can thus 
conceive of each observation as being made up in the 
following way : 

JijL — i. +yi +/£ -f Residual 


(4.18) 



Chapter V 


THE COMPASS OF CHANCE— CORRELATION * 


5.1 Trends — The Relation Between Trend and Significant Differences 
All we have been concerned with hitherto is the distribution 
of measurements, sample averages, and ranges, of the variable 
represented by the vertical scale of the quality control chart. 
The forna of the distribution along the horizontal scale 
which represents the order of our samples according to the 
method of sub-grouping did not concern us, or rather was left 
out of account, except for the brief mentioning of Trends, 
paragraph 2.12. It is time we devoted some attention to that 
second aspect of the plotted chart. 

It is not a very rare occurrence in practical work to get 
charts showing the following picture : 


(a) 


(b) 



X +'/iooo 
X 

MEAN 

■ X ■■ ■■ ■“ 

* 7 

-ViOOQ 


X X 




X 

X 

- X— 




X 

Fig. 5.1 

Part of Control Chart for Averages showing Trends 


Translating the picture into words, the higher the ordinal 
number of our sample the greater the value of the average of 
the sample (a) or vice versa (^i). 

Clearly this would indicate a departure from randomness. 
Some regularity has come in which disturbs the free play of 
chance. 

But the assignable cause we meet here acts in a somewhat 

♦ For a thorough exposition of the Theory of Correlation, the reader is referred 
te Chapters 11-16 of the htrodwtion ia the Theory of Statistks, G. Udrg Tule and 
M. G, Kendall, 11th edition, 1937. 
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different way frona that accountable for by a disturbing factor 
in general. 

First, it is only relative to the sub-groups we have chosen 
that we can speak of lack of randomness. We need not findlhe 
disturbance absolutely, that is, without reference to a second 
qualitatively different variable. 

Secondly, it is not only a disturbing factor but one which, 
although it disturbs randomness, introduces a peculiar kind of 
regularity and order. As such it admits of a new kind of 
“ control ” as we .shall sec presently. 

The ascertaining of a trend on our chart is of just as much 
importance as tliat of a state of control, or lack of it, in the 
usual sense of the term. It is closely connected with the latter 
because if the trend is allowed to continue it is inevitable 
that the averages should soon fall outside the control lines, and 
lack of stability would be the consequence. Charts should, 
therefore, be closely watched for indications of trend. 

If an unmistakable trend occurs, it is a matter for the 
production engineer (if the sub-grouping was according to a 
technical criterion, say, different spindles or machines) or the 
administrative officer (if the sub-grouping was according to 
a labour criterion, say, shifts) to do everything in his power to 
eliminate such disturbance. 

The relation between our former method for ascertaining 
disturbing factors — analysis of variance — and this method — 
trend — ^can be formulated as follows ; 

For a trend to arise it is necessary that the sample means 
should differ by more than they are supposed to do if all the 
samples came from one and the same population. This is also 
the necessary condition for the presence of a disturbing factor 
in general or for lack of control. 

But whereas such condition is also sufficient in the general 
case, it is not sufficient for the emergence of a real trend. 
Such a phenomenon arises only if the successive significantly 
different sample means form, on the whole, an increasing or 
decreasing series. ^ 

In other words, when a disturbing factor is ascertained by 
j^nalysis of variance, there is not always a trend. But wherever 
there is a trend there is also a significant result according to 
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analysis of variance, provided the sub-grouping is according 
to the same variable. 

5.2 The Use of Inclined Control Linjits 

It seems evident that, as a general rule, tire variable accord- 
ing to which we sub-group should not be correlated with the 
characteristic which we wish to control. This is one of the 
conditions under which we can expect the sample averages to 
come within the control limits. 

But there are conceivable cases in which the only sensible 
way of sub-grouping is according to a variable which is corre- 
lated with the characterisdc for wliich we sample, apart from 
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Fig. 5.2 

Control Chart (usual Type) for Average Velocity of Rounds fired 
from one Barrel 

those cases where we are directly and primarily interested in 
the way in which the variables influence one another. 

To fix our ideas, let us think of successive batches of am- 
munition which are fired from one and the same barrel. 
The condition of the barrel will slightly alter from one batch 
to the next, if not from one round to the next. By how much 
it will change depends on the quality of the barrel steel and 
on the quality of the ammunition. Even if the barrel material 
is as it should be according to the specifications laid down for 
barrel steel, the changing “ barrel life,” that is, the ageing of 
the barrel with every new batch of ammunition fired from it, 
win influence the functioning of the ammunition, namely, its 
velocity and pressure. We could evade the effect of barrel 
life ” on these functioning characteristics of the rounds of 
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ammunition by changing the barrel. But this would introduce 
a new disturbing factor due to possible differences in barrels. 
It is therefore advisable, if no other considerations come into 
play, to fire from one baiTel.but to take the influence of barrel 
life properly into account. 

Let us assume that successive batches of 200 rounds each 
of a certain type of ammunition have been Bred. If we now 
wish to control the quality of the ammunition with regard to 
its velocity, we can set up for this purpose a quality control 
chart of the usual kind (Fig. 5.2). 

We see at once that the conti'ol chart idea in the form 



Fig. 5.3 

Control Chart (with Inclined Limits) for Average Velocity of 
Rounds fired from one Barrel 


which is familiar to us does not fit the problem. Not taking 
into account barrel life, we would firom the chart conclude 
that the quality of the ammunition as indicated by the velocity 
was not under control. This conclusion, however, would be 
Wrong if the variation in the velocity which we observe were 
due, not so much to significant differences in quality of the 
ammunition, but to the influence of wear of barrel. 

What is wanted, therefore, is clearly a quality control 
chart so designed as to take properly into account the effect 
of the assignable cause of barrel life. A chart on which the 
grand average line was not horizontal but somewhat inclined 
towards the horizontal would evidently meet this requirement. 
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On either side of that line control limits could be drawn 
within which all sample averages could be expected to lie if 
the quality of the ammunition was under control. A chart of 
thalt kind would represent a picture like the following : 

The problem with which we are faced when trying to 
construct a chart like 5.3 is obviously this : Are there legitimate 
methods available for 

(а) determining the direction of the average level ; and 

(б) obtaining control limits which could serve a purpose 
analogous to that of the 1/40 and 1/1000 limits on the 
ordinary control chart ? 

The answer is that such methods have been developed in 
what is called Correlation Analysis. 

5-3 Correlation Between True Variables — The Correlation Table 
The trend is an instance of the relation between statistical 
vadables known as Correlation. 

The trend, however, formed by succes.sive sample averages 
if the samples are taken periodically cither with regard to 
time or quantity of output, is not the typical case of a corre- 
lation between two variables. We have here one genuine 
variable — ^vertical scale — and one artificial — horizontal scale. 
The latter is only a quasi-variable, in which the series of 
ordinal numbers is used for the purpose of distinguishing 
between samples. The samples are thus ranked, not measured. 

In the typical case, however, both variables are ti'ue chance 
variables, that is, they can take on different values with dif- 
ferent probability. It is here that the correlation method 
unfolds its full power. 

Let us think of a machine operation by which cartridge 
cases are loaded with propellant. What is essential for the 
functioning of the round is the amount of propellant in the 
cartridge case as measured by its weight. But the construction 
of the loading machine is such aS to load a certain volume. 
Considering, now, that the propellant consists either of small 
flakes or cylinders, it is clear diat one and the same weight 
can be more or less closely packed and could thus give slightly 
different volumes, and vice versa. It is evident that the correct 
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powder charge will in such a case depend on the intensity of 
correlation between the volume loaded and its weight. 

The diameter of the column of propellant being constant, 
it is, strictly speaking, the height of filling which is direbtly 
controlled by setting the machine. The weight, on the other 
hand, is only indirectly controlled by the intensity of correlation 
between the height of filling and the weight. It is therefore 
essential for the puqjose of the whole operation to know to 
what extent such dependence exists. This is found in the 
following way ; 

A number of filled cases arc picked at random and measured 
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according to both characteristics — -weight and height of filling. 
These measurements are then recorded in the form of a Corre- 
lation Table, where each individual case is entered according 
to its value in both variables. After having measured, say, 100 
fillings, the table could present the picture Fig. 6.4 on page 110. 

5.4. The Correlation Diagram 

There are various methods of determining the intensity of 
association between two variables. We shall start with the 
simplest — the geometric method. 

The column on the right side of the table gives the mean 
height for a given class interval of weight, and the bottom line 
the mean weight for a given class interval of height. We see 
at a glance that on the whole an increase in height is accom- 
panied by an increase in weight, and vice versa. In such a 
case we speak of a positive correlation between the variables. 
If increase in one variable was accompanied by a decrease in 
the other, it would be a case of a negative correlation. The 
absence of any regularity whatsoever, or the same mean, 
height for all the values of weight (and vice versa), would 
point to absence of correlation. 

To represent correlation graphically we mark on the 
horizontal axis the class intervals for, say, height, and erect 
in the centre of each inteiwal an ordinate whose length is 
proportional to the mean weight in that interval. The individual 
weights on which the mean is based are also entered in the 
following diagram (Fig. 6.6). The scale in this simplest method 
of graphic representation is quite arbitrary. 

We find that the points so obtained can be fitted satis- 
factorily by a straight line, called the Regression Line. This 
line, which corresponds exactly to the trend on our charts, 
indicates the presence of correlation. If it ascends to the right 
it points to a positive, if to the left to a negative. A horizontal 
line would mean the absence of correlation. 

5.5 Galtoa’s Geometrical Method for Determining the Intensity of 
Correlation Between Tt^o Variables 

But this method is very primitive. As the scale for both 
variables is entirely arbitrary we could give our line any 
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Fig. 5.5 
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inclination we choose. We could make it steeper or flatter by 
appropriate changes in the scales. That is why the direction 
of the line cannot be safely used as a measure of the intensity 
of correlation. The first method as it stands cannot, therefore, 
be used for estimating the correlation coefficient which serves 
as a measure of the intensity of correlation. 

If we Wanted to do this geometrically we must standaidize 
the values of the deviations in both variables. We do this by 
expressing them as multiples of their respective standard 
deviations, and therefore as pure numbers, with the result of 
making them compai-able with one another. 

Standardization means dividing by the standard deviation. 
We divide, say, the class interval centres of height by cj^ and 
get the equidistant scale on the horizontal axis. Then we 
divide the averages of weight in each class interval of height 
by and get our vertical ordinate. 

The intersections of the verticals and horizontals erected 
at corresponding points of height and weight will again lie 
approximately on a straight line. But this time the inclination 
of the line is a reliable measure of the amount of correlation. 
We find the correlation coefficient as the ratio of the ordinate 
of a point on the regression line to its abscissa, or as the slope 
of the regression line (Fig. 5.6). 

For our example we find that the correlation equals 0-66 
approximately. 

Just as we chose FI as the independent variable, we could 
have chosen W. The vertical ordinates would then represent 
the centres of the standardized class intervals of W, and the 
horizontal ordinates the standardized averages of H belonging 
to each interval of W. 

Proceeding in the same way as before, we obtain again a 
correlation line, but as a rule .not the same line as before. 
What we have done at first is called " determining the regression 
of W on H ” ; what we are doing now is to determine the 
regression of H on W. The lines are consequently called 
regression lines of W on H, or H on W, respectively. 

Now it is the angle formed by these lines, or rather its 
cosim, which is a measure of the association between tire two 
variables. This allows us in a simple way to find out aU we 
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•want about the limiting values of the correlation co-efficient 
in general. 

The cos of 90° is zero ; consequently, if the lines are at 
right angles, which means that to any value of H there corre- 
sponds always the same mean value of W, and vice versa, 
there is no correlation. 

The cos of 0° is 1. Consequently if the lines coincide, 
which means that to a specified value of H there corresponds 
only one value of W, and vice versa, the correlation is perfect ; 
it has become a functional relationship. 

The cos of any intermediate angle between 90° and 0° can 
take values between 0 and 1. Therefore the correlation co- 
efficient in general can take values between 0 and 1, and in 
the case of negative correlation between 0 and — l. 

As both regression lines must necessarfiy lie in the same 
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quadrant, we can easily see what position will indicate the 
maximum positive correlation, namely 1. Let us assume the 
angle between the two regression lines, whose position with 
regard to their respective independent axis is always similar, 
is anything between 0° and 90°. Now let us move both lines 
towards one another until they coincide, keeping their position 
with regard to their respective axes similar. They will then 
coincide when the angle of inclination of each line towards 
its axis is 45°. This position of a regression line halving the 
quadrant then indicates the maximum of correlation. 

The corresponding line for perfect negative correlation is 
inclined under 135° towards the axis of the independent 
variable. Any positive correlation less than i must be repre- 
sented by a regression line whose angle of inclination towards 
the axis of the independent variable is less than 45°. 


5.6 The Correlation Coefficient Computed by the Product-Moment 
Formula — The Regression Equation 

But any graphic method has its mathematical shortcomings. 
If it is a question of obtaining a correct measure of the intensity 
of correlation between two variables, an arithmetical method 
is recommended. 

The algebraic formula for the correlation coefficient is the 
Product-Moment formula, or Bravais formula : 

.._s/DA 




(S.lfl) 


where /= Frequencies 

D = Deviation from resp. Means 
S = operation of Summing 


which in the case of the deviations being taken from an arbitrary 
mean becomes 




(5.1i) 


where = deviations from re.sp. arbitrary values A^, 

(For meaning of b see Appendix II) 

As an illustration let us use the data of Table 5,4, The following 
diagram (5.7) and paragraphs exltibit the proceclure for em- 
ploying tlie Product Moment formula step by step. 
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(Step 1) It is very advantageous to express both the deviations 
from A and the standard deviations as multiples of the class 
intervals, because this reduces the amount of computation 
considerably. ^ 

Care must be taken that all quantities in the formula for r 
are expressed in the same units ; either all as actual deviations 
from A„ Aj,, or all as multiples of class intervals. 

VVe now divide the table by the cross through A^ and A^ 
into four quadrants, and mark the deviations as multiples of 



Correlation Table adjusted for Calculating the Correlation Coefficient 

Notes 

(1) Tile outer scales represent the deviations from the arbitrary Means, 
A„A„, in multiples of the Class Intervals. 

{%) The figure in the lower right-hand corner of a square represents the pro- 
duct of the deviations s„. and from and A^ respectively. 

(3) Tlie last Columtls contain the positive and negative products fUaUvi whose 
algebraic sum gives the tern| 
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class intervals. A product of the form/fl,fl„ is then clearly the 
product of the deviations of a given square from both A^, and 
times the frequency (/) entered in the square. 

(Step 2) In this way the algebraic sum of all these products is 
formed, and care should be taken to give each product the 
proper sign ; the right-hand upper quadrant and the left-hand 
lower quadrant being positive, the other two negative. The 
products within the cross are, of course, all zero, the A„ Ay 
being zero. We thus obtain 148 for 

We now turn to the quantity in the denominator of (6,lZi) 
(see Appendix I for calculation of M,a). 

(Step 3) Height. 

M„ = Ah + {bn X 10 *) == 14, t 10 J 

(Uniti : 

= 14.') -I- { - -1 X 10} = 145 - 1 = 144 -01 of an 

inch) 

Standard Deviation - ij) x 10 

= {\/2-72 - -01} X 10 = l-C.'S X 10 = lC-5 (in -Ol’s of an inch) 

(Step 4) Weight. 

Mv. = Aw + (5w X -2 *) = 30-55 + f|- S(Deviationj ^ om A Qj ^ J 

= 36-55 + (-55 X -2) = 36-55 + -11 = 36-06 grains 

StandardDeviationow ^ 

= {^2-31 - -30} X -2 = 1-41 X -2 = -28 grains 

(Step 6) Combining the results of steps 2, 3, 4 we have 

raw = S (Deviations from Mu x Deviations from Mw) 

lUOanCTii 

— S(Dev, from Ah ^ Dev. from A\y) ■— 100 ftu&vv formula 6,li and 

148 - (100 X — -1 X -55) _ 153-6 
““ 100 X ^>66 X 1-41 232-06““ m 

+ Class interval Siae 

(Oia) t «; 


9 
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Knowing how to calculate the correlation coefficient we are 
now in a position to substitute for the somewhat crude method 
of drawing the best-fitting line to our scatter of points simply 
firee-hand a more ambitious one. We can calculate its posi- 
tion and, knowing the correlation coefficient, the way for doing 
this has been made veiy smooth indeed. 

If r is the correlation coefficient between height and weight, 
and We want the regression line of W on H, all we have to do 
is this : 

We form the quantity and write the equations for the 

regression line of W on H as 

(W - M,,) =- rSs: (H - MJ, . . . (6.2) 

where r is the correlation coefficient, ci„., have their usual 
meaning as standard deviations of W and H, and M„ stand 
for the respective mean values, and where W stands for the mean 
value of the W-airay belonging to a given H. 

By substituting successive values_of H (class interval centres) 
we obtain corresponding values of W and can plot the regression 
line as the line through such co-ordinated pairs of values of 
H and W (Fig. 5.8). 




THE COMPASS OF CHANCE I IQ 

A similar equation, and regression line, may be obtained 
giving H in terms of W (Regression of H on W) : 

(H — Mjj) = (W — M,y) (see Fig. Q.Ba). 

^ 0W 

This provides the answer to Part I of the problem stated at 
the end of paragraph 6.2. 

5.7 The Part of the Variance of the Dependent Variable which 

Remains Unexplained by Correlation 

We now understand enough of the mechanism of cojTelation 
to tackle the problem of how to make use of it for the purpose of 
obtaining control limits. (Part II of the problem stated at the 
end of paragraph 5.4.) 

It can easily be shown that if the measure of correlation 
between two variables is R, then the part of the total variance 
of the variable Y, say, that is accounted for, or explained by, 
the variation of the other variable, X, is the square of the 
standard deviation of Y, a,,® multiplied by R^. In other words, 
it is the fraction of given by Oy^R**. R^ must always be a 
fraction unless R = zero (no correlation) or R = 1 (functional 
relationship) . 

The total variance of Y, will thus consist of two parts 
= cjy^R^ + Remainder .... (5.2) 

The “ Remainder ” term is the part of the variance of Y 
which is left unexplained by Y’s relation to X. There are now 
two principal measures of correlation, which we call the 
Correlation Ratio (^) and the Correlation Co-efficient (r). 
According to which of these we substitute for the R in formula 
(5.2), the “ remainder ” term will take on a different meaning. 

5.8 The Remainder Term in the Case of the Correlation Ratio (7) 

Let us .first look at Fig. 5.5. The points in a given column 
represent the values of Y (Weight) belonging to a given value of 
X (Height), It is evident that the smaller the range of the Y’s 
within these columns, the surer can a change in X be taken to 
import a change in Y. The more wider spaced they are, that is, 
the greater the range within a column, the more will they over- 
lap from one column to the next or even the next but one, etc., 
and the less reliably does a change in X carry with it a corre- 
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sponding change in Y. Each column has its own frequency 
distribution, and the degree of variability within these partial 
distributions determines the intensity of correlation between the 
variables. ^ 

Now the most reliable measure for the range of variation is 
the standard deviation. If we calculate that parameter of dis- 
persion for each of the columns, then the average of all the 
squares of these partial standard deviations will give a depend- 
able measure of the extent to which a change in X is accom- 
panied by a change in Y, or to what extent Y is controlled by 
X. The greater the average square of the standard deviation 
the less can we rely upon a change in X producing a corre- 
sponding change in Y, and vice versa. 

According to this interpretation, the average square of the 
partial standard deviations, or to be exact, the average of 
the squared deviations of all observations from their respective 
Array Means, should be equal to what we have called above 
the “ remainder ” term in. equation (5.2). And this is actually 
the case. It is the part of the variance of Y which is 
left unaccounted for by its correlation with X. Calling 
the (partial) deviations of the Y’s within a column from 
their respective sub-means, Z, we have, forming one total 
distribution of all Z’s, 

cT/“ = (5.3fl) 

and <7/ = a/ (1 -R^') (5.36) 

The ratio — = -^/i T. Rs (5.3c) 

is a measure of the extent to which Y is not dependent on X and 
is called the Coefficient of Alienation. 

The quantity R ~ (6.3d) 

^ 1 / 

is called the Correlation Ratio, and is usually denoted by ■>]. 
It measures the intensity of correlation between X and Y, no 
matter what the form of the relationship. 

Froffi the above formulae we see that for == 0, = 1, 
and for a, ~ -ij = 0. 

, In words : If the standard deviation within the columns is 
zero, Tj must be 1 because to a given value of X there corre- 
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spends then only one value of Y. If equals the standard 
deviation of Y, rj must be zero, because for a given value of 
X there would be the whole range of Y’s to choose from. 

5.9 Translation Into the Control Chart Picture 

Let us now translate all this into the familiar form of the 
control chart. 



HEIGHT 
Fig. 5.9 

Control Chart for Weight using Sub-groups of Height 


On. the horizontal scale are marked the class intervals of 
the variable height, on the vertical scale those of the variable 
weight. W e then plot the averages of W from each column 
of Table 6.6 against the proper class interval value of H. The 
control limits are obtained from the average range or standard 
deviation within the columns. 

We see that the averages show a definite trend upwards, 
indicating a shifting mean- The correlation ratio can he taken 
as a measure of the extent to which a trend might be expected, 
and therefore indicates to what extent we must expect our 
product to be out of statistical control. In our case it indicates 
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to what extent weight is not under statistical control if we sub- 
group according to class intervals of height. (This does not 
mean, however, that it must appear out of control if the sub- 
grouping was done according to other variables, “ time,’Vfor 
instance ; see paragraph 6.1.) 

This means that the usual type of control chart can in such 
cases only give an indication of the lack of control, but cannot 
effectively be used for controlling the characteristic as the 
term is used in quality control work. 

5.10 The Rcmamder Term in the Case of the Correlation CoefHcient (r) 

The interpretation of the “ remainder ” term of formula 
(5,3) as given above, and that of the measure of correlation, 97 , 
was based upon the standard deviation of the arrays of the 
dependent variable. There is, however, another possible 
conception of the “ remainder ” which yields a measure of 
correlation somewhat different from the correlation ratio. It 
is called the Correlation Coefficient, r, and is the measure most 
commonly used. This measure can best be explained from the 
regression diagram, Fig. 5.5, 

Looking at that diagram, we see that the individual values 
of Y belonging to a given value of X do not lie exactly on the 
regression line, but are more or less scattered around it. The 
closer they arc to the line, the surer could predictions be made 
that a change in X would be followed by a change in Y ; the 
wider their scatter around the line, the less reliably will a 
change in X purport a change in Y. 

Thus the intensity of correlation will in this case depend 
upon the scatter of the Y's, not exactly around their own 
sub-means as before, but around the regression line. IJsing 
the standard deviation of the deviations of these points — 
measured along the Y axis — from the regression line as the 
best measure of dispersion, we arrive at an interpretation of the 

remainder ” as the variance (square of standard deviation) 
of the deviations of the Y’s from the regression line, measured 
on the Y axis. Calling these deviations residuals, Z', we now 
define the remainder as the variance of the residuals Z'. 
As the quantities %' will as a rule be more or less different 
from the quantities Z, and, therefore, different from a,, 
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we must expect the fraction in equations (5.3) to have here 
a somewhat different meaning from This is actually the 
case. The quantity which now takes the place of i? is the 
Coeflficient of Correlation, r, which wc have already met in 
paragraph 5. 6, where the working formula for it was given. It 
is a measure of the linear dependence between two variables. 

The form of equations 5.3a to 5.3d remains unchanged. 
Thus 


Cj," ^ . . . . 

. . . (6.4fl) 


. . . (6.4i) 

= 1 - 

. . , (5.4c) 



.V.2 / . “ 


\/ 1 - , 

‘ ^ (T 

'*1/ 

■ • • • (5.4d) 


5. II A Test for Linearity of Regression 

The comparison of 17 with r for one and the same bi-variate 
universe allows us to draw a conclusion as to whether the 
correlation is linear. Multiplying both numerator and 
o ^ 

denominator of — by N, the number of observations, which 

leaves it unchanged, we can express the relation (6.3c) in words 
thus : 


1 _ 2 _ Sum of Squares of Deviations from Arraj! Means of Y 
^ Sum of Squared Deviations from General Mean of Y 


or 
1 - 


n /•'! 


Sura of Squares of Deviations from Array Means of X 

Sum of Squared Deviations from General Mean of X 

For formula (6.4c) we have : 

^ 2 „ Sum of Squares of Deviations from Linear Regression Line 
Total Sum of Squared Deviations from either x ary 


( 6 . 54 ) 


( 6 . 5 «) 


( 6 . 8 ) 


Considering now that the sum of squared deviations from 
the mean is a minimum, that is, smaller than from any other 
value,* it follows that 1 — r* cannot be less than 1 — 
Furthermore, that the more the line connecting the array 
means deviates from the straight line (regression line), the 

♦ see Appendix II 
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more pronounced will become the difference between 1 — 
and 1 — p®, the former getting greater and greater. 

Consequently r'^ must become increasingly smaller than 
with increased curvilinearity of correlation. This is usedras a 
test for linearity, or otherwise, of correlation. 

5.12 The Standard Etror of Estimate— Control by Correlation 

is called the Standard Error of Estimate, 
and has ‘a function similar to that of the Standard Error of an 
Average. (The difference between the estimated Y and its 
true value is, in fact, the error of an average, because in the 
equation for the regression of Y on X, Y stands for the average 
of Y’s belonging to a given X.) It is a measure of the confidence 
that can be placed in the prediction that a particular X will 
produce the average value of Y given by the ordinate of the 
regression line corresponding to that particular X. 

The Y’s of a particular array belonging to a given X will 
differ more or less from their “ true ” value — ^taking the 
estimated Y on the regression line to be that “ true ” value — 
according to the intensity of correlation. The standard error 
of estimate itow indicates the range on both sides of a particular 
Y within which to expect with a certain probability the “ true ” 
value to lie, and, conversely, the range on both sides of the 
regression line within which for a given X to expect with a 
certain probability the observed Y’s to lie. 

This suggests — assuming a normal distribution of the 
residuals — the use of control lines by taking three times cf^ on 
either side of the regression line as the limits within which all 
Y’s can be expected to lie, and two times as including 
96 per cent, of the plotted points (Fig. 6.10). 

For the purpose of control, the standard error of estimate 
is determined for the pair of variables as it results from the 
best possible performance of both, machine and operator, 
using standard material. From that standard error, control 
limits are obtained in the usual way. Should subsequent 
inspection, carried out at specified intervals, yield points falling 
outside the control limits at two respectively three times the 
Standard error in greater than the assigned proportions, this 
is taken as in indication that the intensity of dependence 




Fig. 5.10 

Directed Control Chart for Weight using Sub-groups of Height 
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between the variables was relaxed, and steps would be taken 
with a view to ascertaining and removing the cause of such a 
disturbance. 

Such limits are conU'ol limits in a somewhat different sewse 
of the term from tltat which we have encountered hitherto. 

First, on the practical side : 

What we are concerned with here is not so much the 
question of whether one particular column shows a Y greater 
than it ought to (although even that question can become 
important), but to what extent Y in geneyal is controlled by X. 
This information is provided by the distance between our 
control lines measured along the Y axis. 

The direction of the regression line itself can provide such 
information only if the variables arc standardized ; tlien, its 
inclination between 0 and i5° will indicate the amount of 
correlation varying between 0 and 1. But the control lines 
at ± three times u,- will do that also and even without standard- 
ization. 

All we have to do is to read the distance between these 
lines on the Y scale and it will give us a very good idea of 
what the state of control is like, getting all information about 
this matter in a single distance. Such visual or spatial idea 
of the extent of association will sometimes have advantages 
over the 'Usual method of compressing the information into 
one single parameter, because it shows the amount of overlap 
between the partial distributions of the dependent variable at 
a glance, upon which, as we know, depends the intensity of 
correlation. 

Thus, direction of regression lines and distance of control 
limits supplement one another here very effectively. 

Secondly, on the theoretical side ; 

The control limits on the chart for averages are three times the 
standard error of the averages, tliose on the chart for the standard 
deviation or range are three times the standard error of the 
standard deviation or ranges respectively. But the control limits 
on the correlation diagram must not be taken to represent a 
multiple of the standard error of the correlation coefficient, Apart 
from our scanty theoretical knowledge of that type of error for 
non-normal universes, this is not what we are concerned with. 
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If we were interested in the variations of the dii action of 
the regression line representing different samples from one and 
the same universe, then we would have to take the standard 
erioj; of the correlation coefficient into account.* 

Although this question of the significance of the correlation 
coefficient obtained from a sample may become very important, 
especially if the sample was small, we can fortunately evade 
that complication by working with large samples of, say, 100 
to 300. For our purpose it is more important to obtain a really 
good working idea of what a certain amount of R means, and 
this we can get by the method described above. The distance 
between the ± Su,. lines, read off the Y scale, shows at a 
glance the variability of the Y’s belonging to a given X. 
Further, we see immediately the amount of overlap between 
neighbouring columns, and can thus get a very good idea to 
what extent a change in X controls a change in Y. We also 
see the steps by which Y increases per unit increase in X. 

5.13 The Compass of Chance 

Let us still further realize how our idea of control lines 
has changed if we speak of the lines Ai, Aj in Fig. 5.10 as 
control lines. 

They are what might be called “ directed control lines.” 
By this is meant that it is not only their position which matters 
now but also their direction. We can, under certain conditions, 
read the amount of correlation between the variables, repre- 
sented on our two scales, from their inclination toward the 
horizontal. Generally, their direction is no longer one and 
the same on all charts, as is the case with om- former control 
limits, but depends on the amount of correlation between the 
variables. That is why the simile of the Compass has been 
chosen for the heading of this chapter. 

Let us suppose that a ship to whose captain definite sailing 
orders have been given at the commencement of the journey 
had arrived at her destination, and we were asked to find from 
the log book whether the ship’s course had always been in 
conformity with those orders. The course according to the 

* For a brief and dear account of the sampling error of r, see Levy and 
Preidel, Elementary Statistics, Nelson’s Aeroscience Manuals, ISil 
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sailing directions we call the “ intended course.” It can be 
thought of as a specified direction. Studying now the log book 
day by day, we can ascertain to what extent that direction was 
adhered to. We shall find, no doubt, that, on the whole,, the 
ship was steered in compliance with her intended course, but 
wind and weather might not always have permitted her to 
do so completely. There is also the possibility of deviation 
from the intended course because of inefficiency on the part 
of the officer or men in charge, and because of inherent short- 
comings in the construction of the ship and her machinery. 

The degree to which the sailing direction was actually 
adhered to could be represented graphically by plotting the 
actual position of the ship, day by day, or watch by watch. 
We should then find tJiat these points are arrayed around the 
line (straight line or smooth curve) representing the intended 
course. The sum of the squared daily deviations from that line 
divided by the total number of observations (giving the square 
of the standard deviation of the actual positions from the 
intended course) will give a good idea of the extent to which 
the officer in command had been able to make the ship carry 
out her sailing orders. The better he was able to do this, the 
smaller would be the standard deviation. If we drew “ control 
limits ” at two or three limes that standard deviation, then the 
distance between such limits would be a good indication of the 
extent to wliich the ship’s course has been “ under control.” 

In the same way we can think of our control limits around 
the regi’ession line as indicating, by the distance between 
them, to what extent one variable (the dependent one) was 
controlled by the other, a small distance indicating a high 
degree, a wide a low degree, of control. 

But as omne simile claudicat, our analogy should not be 
pressed too far. In Figs. 6.6, 6,10 we cannot regard the 
direction of the regression line, and therefore of the control 
limits, as a reliable indication of anything ; only the distance 
between the control limits would be indicative of the amount 
of correlation. In Fig. 6.6, on the other hand, we have 
in the direction of the regression line already a measure of the 
amount of association between our variables, and the distance 
between the control limits is, therefore, here dependent on 
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their direction. As the angle of inclination increases from 0° 
to 46°, that distance becomes smaller and smaller. The “ in- 
tended course” of a ship, however, gives by itself the direction, 
and^the deviations of the actual cotmse from that theoretical 
course do not .stand in a functional relationship to its direcUon. 

But in spite of such shortcomings in our analogy, I believe 
that it will serve to bring out the essential difference between 
control charts with horizontal and inclined control limits, and 
so contribute towards making a difficult distinction clearer. 

5.14 Summary 

A. The disadvantage of the simple graph method for show- 
ing correlations is that the inclination of the regression line is 
no measure of correlation. 

The disadvantage of the Gallon method is the complexity 
of procedure. Now the use of control limits in the sense 
described above makes it possible to retain the simple method, 
and yet obtain a very good idea of tlte intensity of coiTclatiqn. 

It is, of course, possible to provide the Galton diagram with 
control limits, and so supplement the summary information 
obtained from that diagram as to the amount of correlation 
by the more detailed one of the standard error of estimate. 

B. The study of simple correlation has thus led to an im- 
portant generalization of the control line idea.* 

The horizontal control hnes appeal’ to be a special kind 
of control lines, only applicable if there was no correlation 
between the characteristic in question and the characteristic 
used for sub-grouping {control by chance causes) . 

If, however, ^uch correlation exists, and randomness is 
disturbed, we can still use control lines which are so designed 
as to provide the information we are after in such cases. What 
we want to know in such cases is to what extent our primarily 
important characteristic is eflectively controlled by another 
characteristic which is directly accessible to measurement 
{control by assignable causes ) . 

In the example given we have, for instance, found that 
approximately 44 per cent, of the variance of weight was con- 

* Fdr a case of elliptical control lines, see G. Herdaii, " The Analysis of 
Colour,” British Science Mews, 1947, vol. i, 4 
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trolled by height of filling { 1 ^ = = -4366), and if we are 

satisfied that for tjie performance in question such control is 
sufficient, that measure of dependence should form part of the 
specifications for filling by height. r 

Considerations of that kind must, whether they are explicitly 
aware of it or not, form the basis for the management’s decision 
to use a machine by which a certain characteristic can only be 
indirectly controlled. 

5.15 Illustration for Curvilinear Regression 

(a) For the proper functioning of cartridge cases it is re- 
quired that, at specified intervals along the case, the hardness 
of the metal be of a certain degree. This can also be expressed 
by saying that the gradient of hardness at specified points along 
the case should not exceed certain limits. 

We thus arrive, as a fundamental requirement for good 
functioning of tlie case, at a correlation between distance 
measured along the case and hardness of metal. The regression 
curve is a rather complicated one, as shown in the following 
diagram : 
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(b) The assignable causes foa- hai'dness are mainly these : 

(i) The quality of the metal strip from which tire discs 
are olrtained, which are transfuimed into the cases ; 

(ii) the operation of the machine which draws the 
metal discs out into the case ; and 

(hi) the semi-annealing and tip annealing operations. 

It is evident that these main assignable causes must be 
controlled, in the engineering sense of the term, in order that 
the cases should have the required gradient of haj’dncss. A 
continuous check on production, with a view to finding out 
whether or not the product complies with the requirements 
regarding hardness, can be achieved by means of a quality 
control chmt of a peculiar kind. In this type of chart use is 
made of the correlation between the distance along the case 
and the hardness of the metal. It is evident that the distance 
of the case does not “ control ” the hardness in the engineering 
sense of the term. Such control cah only be achieved by con- 
trolling the assignable causes. But we can make use of the 
correlation between distance along the case and hardness for 
satisfying ourselves during mass production that these assignable 
causes are working in just the way in which they should work. 
That is, that they are blended so as to produce cases according 
to specifications. 

(c) In order to construct a quality control chart of that kind 
we must first obtain the correlation cuiwe bet-ween distance and 
hardness. As the curve is rather a complicated one, we obtain 
it graphically, ’VVe select a number of cases, say 100, of which 
we know that the hardness more or less satisfies specifications, 
take the mean hardness at each of the specified points, connect 
those means by a continuous emwe, and so obtain the regression 
curve graphically. 

"What we arc now interested in is to know the control limits 
which will enable us to separate chance deviations from devia- 
tions brought about by significant changes in the assignable 
cau.ses. For this purpose we must know the standard error of 
estimate for our regression curve. To obtain tliis we proceed 
in the following way. 

A point on the curve at a specified distance along the case 
length is obtained as an average of a distribution of points 
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(array of points) , We now form the sum of squares of deviations 
of the points (constituting such an array) from the array mean, 
sum for all arrays, divide by the sum of squares of deviations 
of all hardness measurements from the general mean, and 
subtract the result from unity. This gives us the square of the 
correlation ratio, tj®. The standard error of estimate can now 
be obtained as the product of the standard deviation of the 
total of hardness measurements by Vl — (equation 5.3c). 

The inner control limits will be at a distance of twice and 
the outer control limits at a distance of three times the standard 
error of estimate on either side of the regression curve. As seen 
from the following diagram (Fig. 5.12), this gives us a band 
within which hardness measurements at specified intervals are 
allowed to vary without justifying the suspicion that an un- 
desirable change in one of the assignable causes has occurred. 
A hardness measurement outside that band, however, would 
indicate such a change, and, consequently, point to the neces- 
sity of investigating the matter. 

{d) In order to use the above metjrod for continuous 
checking of mass production some modification must be made 
in the charting. First of all, we arc only interested in the case- 
hardness at specified points. We therefore need not have a 
continuous curve. Secondly, we want a chart drawn up in 
such a way as to enable us to plot at specified intervals a great 
number of examination results — that is, of successive samples 
of cases. 

As can be seen from the following diagram, the fact that we 
are only interested in the case-hardness at specified discrete 
points along the case length enables us to meet the second 
requirement. 

Instead of putting a continuous curve on the chart, we put 
disconnected horizontal lines at the regression level of hardness 
for the specified points along the case length, and parallel to 
these also disconnected horizontal lines at twice and three times 
the standard error of estimate. We then plot the inspection 
results for each specified distance from tire lower end of the 
case, one beside the other, a case being represented by a dot 
at each hardness level. According to the length of the horizontal 
lines we have chosen—and that again depends on the dimensions 
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of our chart — ^wc then can use the chart for recording the results 
of a great number of successive samples. 

5.16 Multiple Correlation 

r 

So far we have been concerned in this chapter with the 
mutual dependence of two characteristics. Although it is not 
the purpose of this first introduction to correlation methods 
to discu.ss in detail the intricacies of partial and multiple 
correlations which arise in the investigation of an eifect con- 
ceived as depending upon the combined action of a number 
of “ indepencleni ” variable causes, a brief outline of how 
to proceed in such cases will be given in this paragraph, since 
in industrial research work we are very often confronted with 
problems of this type. 

The multiple correlation analysis can be performed by 
means of formulae for partial correlation coefficients — for the 
net effect of one independent cause upon the dependent 
variable — a procedure which can be recommended only if 
such formulae are fully understood. It is, however, possible, 
just as in the case of multi-variate analysis of variance, to 
simplify the analysis, sacrificing elegance of mathematical 
procedure to real understanding, by taking out the effect of 
one variable at a time. This procedure is here called the 
technique of “ sticcessive elimination , of which this paragraph 
gives a brief outline.* 

The experimental data on which I propose to demonstrate 
the method were obtained from an experiment on corrosion 
of steel, whose object was to ascertain the dependence of the 
corrosion effect upon (1) time of exposure, and (2) thickness of 
a protective film. Thirty steel panels were exposed to weather- 
ing conditions, and the loss of weight due to corrosion was 
recorded for each panel whose film thickness was also ascer- 
tained at certain time intervals. In Fig. 6.13, the loss of 
weight due to corrosion is plotted against time of exposure for 
, these tliirty panels. Assuming a linear regression between the 
two quantities—which is, however, only approximately true — ’ 
we obtain the correlation coefficient by any of the methods 

* For a full account of this very important technique the reader is refen-ed 
to M. Ezekiel, Methads of CorretcHkin Andysb, J. Wiley & Sons, New York, 1941 
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explained in this chapter, which enables us to construct the 
regression line according to paragraph 5.0. 

There is, however, no need for calculating the coefficient 
of Cjprrclation if what we are interested in is primarily the 
form, and not so much the amount, of dependence between our 
variables. In such a case, we can calculate the two constants 
of the linear regression line directly. Calling the loss of weight 
due to corrosion Y, and the time of exposure X, we have for 
the slope of the regression line, 

^ ‘ ’ 


In order to understand how this formula is connected with 
those used previously in para. 5.G for the correlation coefficient, 

r, and for the coefficient of regression (— ), let us write for the 
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Using instead of x andji which are the deviations from the 
respective means, the original measurements X and Y, equations 
(5.8a) become 
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For the intercept on the Y-axis we have 

a — M,, — byt. Mx (5.9) 

which relation is obtained simply by substituting in the general 
form of the linear regression equation y — a bx for x and y 
the corresponding means and for the constant b the slope by^.. 

Calculated in this way, the regression line showing the 
dependence of corrosion upon time results as 

Y' = 0-068 + 0’025X (5,10) 

* S standing for “ the sum of nil . . .” 
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Fig. 5.13 


This means that one day exposure to the weathering tests 
produces, on the whole, a weight loss due to corrosion of 0-025 g. 

But this is dearly not Ae whole story. We have here 
implicitly assumed— apart fiom the linearity of the relation — 
that time was the only factor responsible for the severity of the 
corrosion effect. There is, however, as we know, another 
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factor which might conceivably influence the corrosion effect, 
viz. the thickness of the protective film. We must, tlierefore, 
correct the picture for the film effect. 

5’he first thing to do is to take the time effect out of the 
total corro.sion cfiect, or, as we say, to “ adjust ” the weight 
loss for the time effect. Such “ adjusting ” is done either by 
calculating the weight loss with time according to equation 
(6.10), and subtracting the quantities so obtained from the 
observed weight loss of the panel in question, or, graphically, 
by measuring along the Y-axis, the distance of the plotted 
points from the regres.sion line (Fig. 5.13). In this way, the 
weight loss due to time is taken out of the total weight loss, 
and these numerical differences or distances representing the 
variation in weight loss due to variation in film thickness (or, 
may be due to other factors as well) are now plotted against 
the individual film thickness of the thirty panels (Fig. 6.14). 

Comparing the resulting scatter with that of the original 
observations, we notice at a glance how much in the latter 
was due to the time factor. We now calculate the regression 
line exactly as before, and obtain the equation 

Y" = 0-164 - 0-676 Z (6.11) 

Since Y" represents the weight loss adjusted for the time 
effect, equation 5,11 must necessarily give the net effect of 
film thickness on corrosion, viz. that a unit increase in film 
thickness means a reduction of weight loss by 0-575 g. 

We cannot combine both regression equations, since equation 
6.10 does not yet represent the net effect of time on corrosion. 

We now repeat for the time factor what we have done for 
the film thickness factor. That is, we adjust the total weight 
loss for the net effect of film tliickness, either numerically by 
subtracting each Y" obtained by equation 5.11 from the corre- 
sponding observed Y, or graphically from Fig. 5.14, and plot 
the residuals against time. This means that we have now 
plotted the net effect of time on corrosion against time. The 
regression equation results as 

Y" = 0-047 + 0-026 X 
which differs only slightly from equation 6.10. 

Gombining now the equations for the net effects, or for the 
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partial regression, of our two independent variables we obtain 
the multiple regression equation 

Y" == 0'2n + 0-026 X - 0-575 Z (5.12) 

which tells us how the corrosion effect is influenced by time 
of exposure and film thickness, viz. that one day exposure 
wiU, on the whole, result in a weight loss of 0-026 g,, and 
that one unit increases in film thickness will, on tire whole, 
mean a decrease in weight loss of 0-676 g. 



Chapter VI 


“DEFECTIVES” AS ACCIDENTS— Q.UALITATIVE 
CONTROL CHARTS 

6.1 Distribiitioji of Repeated Events— The Statistical Theory of 
Accidents 

We arc by now used to the idea of chance deviations of measure- 
ments obeying a certain kind of law, which we call the Normal 
Law. But it might be very surprising to hear that the type 
of chance events which we call accidents also obeys a law. Let 
us realize what is meant by this. 

It was in the second half of the nineteenth century that a 
Russian statistician, Bortkiewicz, hit upon the following idea. 
In order to find 'out whether there was any regularity in a great 
number of evidently unconnected accidental occurrences of 
one and the same D'pe, he investigated a seemingly eiTatic 
event — death by kick of a horse in the Prussian Army. He 
collected information about ten Prussian Army Corps over a 
period of twenty years, and then grouped the observations 
according to whether 0, 1, 2, 3, and so forth, accidents of this 
kind had occurred per year per Army Corps. This yielded 
the following frequency distribution (second column) ; * 


Deaths 

Frequency 

Per year prr 



army corps 

Observed 

Expected 

0 

109 

108-67 

1 

66 

66-20 

2 

22 

20-22 

;{ 

3 

4-11 

4 

1 

-63 

0 

— 

-08 

6 


•01 


If we were to plot this frequency distribution qn graph 
paper we would get a very different shape from the normal 

* L. Bortkiewicz, Lmu of Small Leipzig 1897 
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curve. It would give a J-shapcd distribution, or rather a 
reversed J-shape if we took the origin, as is customary, on 
the left side. The graph would be highest at zero, and would 
approach the horizontal axis more and more the greater- the 
frequency of accidents in the chosen interval. 

It then occui'red to Bortkiewicz that a certain mathematical 
formula, which was arrived at many years ago by a French 
mathematician, Poisson, for the distribution of statistically 
rare events, might be suitable for expressing the law governing 
this kind of distribution. Substituting for the algebraic symbols 
in that formula certain characteristic quantities of the case 
under investigation, he calculated the frequencies in the 
various class intervals, according to that formula, and obtained 
the figures given in the third column of the above table. The 
conformity between observed and theoretical figures is so 
striking that it convinced scientists of the suitability of the 
formula for describing accident distributions. 

Experience has shown that they were quite correct, and 
although the simple formula first used underwent several 
modifications according to the increased complexity of the 
data to which it was to be applied, we are still using to-day 
fundamentally the same idea and formula, and the whole 
complex of the mathematical forms employed is called the 
Statistical Theory of Accidents.* 

We have to-day arrived at tlie theory that although an 
accident of a certain type, taken by itself, is something quite 
unpredictable because of the variety of causes involved, a great 
number of accidents can be transformed into a series which 
shows the working of the Law of Chance. It follows that, 
although we cannot predict whether or not a certain individual 
will meet with such an accident, we can predict how many 
space-time intervals, like Army Corps per year, will be affected 
— that is, affected with 0, 1, 2, etc., accidents of that kind over 
a great period of time, or, if we were dealing with minor 
accidents in a given occupation which could occur several times 
in one and the same person, how many persons there would be 
with no accident, one accident, two accidents, three accidents, 
etc., occurring to them over a given period of time. 

* It owes its modem fotiii to G. U. Yule and M. Greenwood 
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6.2 The Application to Industrial Control Inspection — ^Defectives as 
Accidents 

How is that idea of regularity in the occurrence of accidents 
conj,tected with indu.strial control ? 

It comes into play whenever we are dealing with what is 
called “ Go — No Go ” gauging of a product. In this case we 
are not concerned with taking the exact measurements, but 
we put the article which we are examining for a certain char- 
acteristic through a gauge which is constructed according to 
the specifications laid down for that characteristic. The gauge 
has two parts. If the article fits into the “ Go ” part of the 
gauge it means that that characteristic does not transgress the 
upper specification. If it does not fit into the “ No Go ” 
part of the gauge — the “ forbidden ” part — ^it means that the 
characteristic is not below the lower tolerance. If these two 
conditions materialize, the article is satisfactory. If, however, 
it does not fit into the “ Go ” part, or does fit into the “ No Go ” 
part, then the dimensions of the characteristic are larger than 
admissible or smaller than admissible. 

It is evident that one could construct and use “ Go — 
No Go ” gauges for characteristics which can be measured in 
the strict sense of the term. The gauges are then constructed 
according to the greatest and smallest measurements which 
are to be tolerated. 

There are, however, cases of control inspection where 
the use of a gauge is the only possible way for controlling a 
given characteristic, and it is tliese cases to which the accident 
theory specially applies. If, for instance, wc are dealing with 
the form of a somewhat complicated body, or with functioning 
defects, we cannot apply proper measuring instruments, The 
defect is here the result of a complex of causes, and materializes 
only if all these causes are present. If one or the other is 
missing the effect simply does not arise. Thefe are thus no 
degrees of defectiveness in this case. The defect either exists 
or does not exist, and a third possibility is not given. 

Let us, for instance, think of the form of a round of am- 
munition. Specificatioas are laid down for that form, so as to 
make it fit into the barrel of the gun from which it is fired. 
The only way of ascertaining whether the product complies 
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with these specifications is to have a gauge available which 
imitates the dimensions of the barrel, and then try whether the 
round fits into that chamber, and if so, if it does not fall too 
short of the dimensions of that gauge. What this amount" to 
is to find out whether the round fits perfectly into the barrel. 

In such a case it is not any more a question of chance 
causing a well-defined characteristic to deviate from the 
specification value to a greater or lesser degree, but of the 
combined effect of causes which, if the process is controlled 
properly, tvill produce a round of the right quality. There 
may be, however, comparatively rare cases in which all these 
very different causes combine, by pure chance, in such a way 
as to bring about a misfit, and such a combination is then 
quite in the nature of an accident. It cither happens or docs 
not happen, the fiist part of the alternative being comparatively 
rare. This is why in such cases it is conceivable to use the 
theory of accidents shortly outlined above, and it has been 
found in industrial practice that the theory is suitable for that 
purpose. 

Although there are no degrees of accident as such, this 
alternative of presence or absence of a defect does not prevent 
us from drawing up a frequency distribution of samples 
according to the number of times a defect has occurred. For 
a product having a low average defectiveness, we wiU, as a 
rule, get a great number of samples (each consisting of a 
number of units) with no defects, and decreasingly smaller 
numbers of samples with 1, 2, 3, etc., defects. 

6.3 Constructing the Quality Control Chart — The Meaning of the 
Control Limits 

Having found that the number of defects in samples of a 
given size obeys a similar law as do industrial measurements — 
in fact, the law referred to as the Poisson Law, is very closely 
related to the Normal Law — ^it is but a small step to the idea 
of controlling the quality of a non-measurable characteristic 
by means of control charts of a type similar' to that which we 
already know. These charts are sometimes referred to as 
Qualitative or Number Defective (Fraction Defective) charts to 
distinguish them from Qpantitative or Measurement ones. 
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The usual practice in inspecting a product for a uon- 
measurablc characteristic is to take a fixed number of units, 
say 20 to 60, putting each through the “ Go— No Go ” gauge, 
and^ counting the number of cither “Go” or “No Go” 
defects. If the characteristic was length of an. article we 
would then get two distributions, one for the “ Go ” part and 
one for the “ No Go ” part of the gauge. If it was form of 
an article, we would, as a rule, get only one distribution, 
because in form gauging one is usually satisfied with the article 
fitting into the gauge, and the question of whether it fits too 
well into it does not always arise. 

The sample quality is then recorded on the charts— one 
for each gauging characteristic — ^in the usual way. The left- 
hand side of the chart bears a scale starting at zero and pro- 
gressing in steps of one, symbolizing the number of defects 
per sample of n units. The sample quality is tlien plotted by 
putting an asterisk (*) for the sample against the number of 
defects occurring in it. 

The average defectiveness is easily found by adding all the 
defectives in, say, 50 samples, and dividing by the number of 
samples — that is, 60 in our case. It is this mean defectiveness 
of the product, as found fi'om the samples, which allows us 
in a very simple way — according to the Poisson formula re- 
ferred to above — to arrive at the 1 in 40 and 1 in 1000 limits. The 
meaning of tltese limits is fundamentally the same as described 
in Chapter II. They represent two (1'96 to be precise) times 
and three (3'09 to be precise) times, respectively, the standard 
deviation measured from the mean. But the standard deviation 
of a perfect Poisson distribution stands in a very simple relation 
to the mean : it is simply its square root. All we have to do, 
therefore, in order to find our limits, is to multiply the square 
root of the mean by 2 and 3, measure these distances from the 
•mean in the positive direction, and draw horizontal lines at 
the nearest point of the left-hand scale for numbefs of defects 
per sample * (Fig- d-lj P- 145). 

One difference from the ordinary chart is that the qualita- 

* As in this type of distribution the mean and the standard deviation are 
no longer independent of one another, it is dear that only om chart will be needed 
for controlling both level and spread. 
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live chart is a one-sided chart. It would be practically useless, 
although theoretically possible, to put the lower limits on the 
chart, especially if they are beyond zero. We therefore plot 
only one pair of the inner and outer control lines on the clp,art, 
namely, those which we obtain by adding two or three times 
the standard deviation to the mean. 

As we arc dealing with a one-sided distribution, the 
quantitative meaning of our control limits is somewhat modified. 
The 1 in 40 limit means in this case that there should be only 
1 in 40 found beyond this limit in the whole distribution, 
whereas on the quantitative chart we expect 1 in 40 on each 
side of the mean to lie beyond that limit. The 1 in 40 limit, 
therefore, on the qualitative chart cuts off a percentage of the 
graph amounting, not to -05, but to -025. Similarly, the 
1 in 1000 limit cuts off on the qualitative chart an area amount- 
ing to '001, whereas on the quantitative chart the area of the 
graph lying beyond both 1 in 1000 limits is -002 per cent. 

The meaning of these limits is that if the defects in question 
occurred like accidents, on pure chance, with no disturbing 
factor intervening, we should expect only 1 sample in 40 
approximately to lie beyond the inner and practically nothing 
to lie beyond the outer limit. To what extent this is met with 
in practice we shall see in paragraphs 6.6 to 6.7. But first 
a simplification of the method for constructing the 1 in 40 and 
1 in 1000 lines will be briefly outlined, 

6.4 Labour-saving Method for Obtaining the Control Limits 

Although the arithmetical effort for constructing those 
lines is by no means great, it can be reduced still more by 
using what is known as the Poisson probability limit and charts 
constructed from it. In the chart. Appendix V, the Poisson 
probability limit is given in the form of a chart, from which 
for a given mean number of defectives we read the probability 
that a specified number of defectives or more will be found 
in the sample (chart, Appendix Va). Sometimes the prob- 
ability scale on the left-hand side of the chart is turned 
upside-down. If so, the chart tells us that a specified number 
of defectives , or less will be found in the sample (chart, 
Appendix Vb). 
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From charts like these the diagram, Appendix VI, 
is constructed, which allows us at a glance, for a given mean 
defectiveness, (M), to read the 1 in 40 limit and the 1 in 1000 
limit, by obtaining the crossing-point of the vertical erected 
at the observed mean defectiveness per sample with the 
1 in 1000 and 1 in 40 curve, and projecting it to the left. 
There we read the number of defectives at which to draw 
the 1/40 and 1/1000 limits on our chart (sufficiently correct 
for defectiveness of less than 10 per cent.). 


6.5 Computation of Limits if Defectiveness is Greater than lo per cent. 

This method for computing the position of the control 
lines — by means of the Poisson Law — should be used only if 
the defectiveness in the population is less than 10 per cent. 

For greater defectiveness, the standard deviation is cal- 
culated according to the general formula for the Binomial * 
distribution 


or 


cr = •\/npq for Number defectives . . . . (6.1) 
c — ^1— for Fraction defectives .... (6.2) 


where q is the defectiveness, expressed as a fraction of 1, 
^ = 1 — and n the number of items per sample. 

As we do not, as a rule, know q exactly, we substitute for 

q the average defectiveness per individual sample, — {m stand- 

n 

n — m 


ing for the number of defectives), for p we substitute 


n 


and write for the standard deviation of samples of n with 
regard to the number of defectives 


/ min — m) 


(6.3) 


and with regard to the fraction defectiveness 


a 



m) 


. . (6.4) 


* For an explanation of this term see Chapter VII, paragraph 7.i 
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Two times and three times that quantity added to the average 
defectiveness in samples of then gives the 1/40 and 1/1000 
limits, respectively. 

(5.6 Tie Distinction Between Statistical and Economic Control 

The work with these control lines on qualitative charts is 
very similar to that with the control lines on quantitative 
charts, but there are important differences which must be 
observed if the product is to be effectively controlled, not only 
statistically but also economically. The differences arise from 
the fact that this time, on the qualitative chart for number 
defectiveness, we are plotting not averages, but single sample 
qualities. Consequently, the greater the sample size the more 
(lefectives will, as a rule, be gathered in a sample, and as the 
control limits are calculated from the sample quality, the 
wider will be the control limits. This is very different from 
the effect of increased sample .size on our quantitative charts, 
where the control limits become naiTOwer the greater the sample 
size, because the standard error of the mean, which forms the 
basis for arriving at these control limits, decreases in the ratio 
liVn. This inconvenience can, however, be avoided by work- 
ing with the fraction instead of the number defectiveness. 

Furthermore, it is evident — and this the qualitative chart 
has in common with the quantitative range chart — that the 
worse the quality gets, the wider will be the control lines. 

The consequence is that cjualitative, like quantitative, con- 
trol limits must never be used without taking into account what 
the economically desirable quality of the product should be. 
If it is desirable that the product should not be worse than 
say, 1 per cent, defective, then one must not be content with 
working to control limits which correspond to an, average 
pefcctivencss of significantly naore than 1 per cent., assuming 
that if everythipg comes out within the control limits it will 
be all right. This would be a very wrong assumption. Let 
us make that point clear by referring to a matter which is 
familiar to everybody (Fig. 6.2). 

An Insurance Company will calculate premiums according 
to the risk to be insured. The premium will he greater the 
greater the risk. To fix our ideas let us think of the number of 
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accidents for a given occupation or for a given period of life. 
The greater the danger of accidents in a given occupation, and 
the more exposed to dangers a certain period of life is, the higher 
will jjie the premium charged by the insurance Company. 

On the other hand, it is desirable to reduce the premium as 
much as possible, in order to make as many people as possible 
take out insurance policies without too great economic dis- 
advantage for them. If the Insurance Company is to comply 
with this desideratum, it can do so only if the risk is reduced, 
that is, if the rate of accidents is lowered. 

What corresponds on our charts to the premium — that is, 
to the way in which the Insurance Company tries to protect 
itself against possibilities of accidents — are the control lines. 
The greater the defectiveness of the product, and therefore the 
risk of defectives occurring, the wider will be the control lines. 
Too many defectives, however, would be just as undesirable 
as are too high premiums. Only a few people would take out 
insurances, and only a very small part of the product would 
pass final inspection, and a great effort would be wasted, in 
spite of everything being under statistical control. 

The way out of the difficulty is to try to reduce the 
defectiveness in the product so as to make the manufacture of 
that product profitable. The indication that this ought to be 
done is given by comparing the actual position of the control 
lines with those which an economically fairly satisfactory 
product would yield. If the control lines are sensibly wider, 
then it is for the Production Engineer to do everything in his 
power to bring them to the level of the economically desirable 
control limits. 

6.7 Working to a Target 

Here again we must be on our guard not to confuse statistical 
limits with specifications. 

Let us assume that a Production Management wanted to 
work to a target, that is, was aiming at a product with no 
more defectives than a certain amount, and let us assume 
that amount to be, say, 4 per cent. If samples of 60 are taken 
every half-hour we should expect about 2 in 60 to be defective. 
This would correspond to specification. From the chart, 

C612) 1 1 
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Appendix VI, we read the inner and outer limits for this case 
as 6 and 8. But it is not these limits which should be put on the 
chart, once for all, although they may be put into the margin 
to remind us of the tai'get. Whether or not a certain batch 
of the product complies with this target can only be ascertained 
by calculating the 1 in 40 and 1 in 1000 from the actual samples 
taken. The position of these actual control lines is then com- 
pared with the control lines for 4 per cent, defectiveness in 
samples of 50, namely, 6 and 8. If they are wider, the machine 
must be altered or other steps taken in order to make the 
product comply with the target. 

But it would be a great mistake to put the target limits of 
6 and B defectives on the chart and then work to these limits — 
that is, adjust the machine only when a point occurred outside 
one of these limits. The reason is this. It is true that if the 
average defectiveness in samples of 60 was 2, which amounts 
to 4 per cent., there would be very few outside the 6 and practi- 
cally none outside the 8 limit. The converse, however, is not 
true, and this is where the method of fixed limits breaks down. 
If, namely, on a chart there are a few samples outside the inner 
limit and practically none beyond the outer limit of 8, it does 
not necessarily follow that the defectiveness of the lot is not 
more than 4 per cent. It is quite feasible that all the samples 
taken over a certain period wiU be round about, say, the 
5 defectives line. According to the idea of working to the fixed 
limits, there would be no need for correcting the machine, 
because all the samples would be within the limits. But, as 
most of them would contain 5 defectives, the product would 
be approximately 10 per cent, defective. This case is illustrated 
in the right half of Fig. 6.3. 

If the product is of great variability, then the use of the 
method of fixed limits is still more to be deprecated, because 
what it amounts to is that machines must be adjusted on the 
findings of single samples. A single sample, however, cannot 
be regarded as representative of the lot quality if the variability 
of machines or components is high. The proper statistical 
n^|thod, on the other hand, is better suited for such cases of 
great variability, because it does not work with single samples 
but by the combined results of, say, 20 to 60 samples. These 
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samples are used for arriving at the average defectiveness, and 
on the basis of that, at the control limits, and these limits, 
therefore, give a proper indication of the lot quality. That is 
why they can be used for adjusting the machines or mt^ting 
other engineering modifications to the process in question. 

It cannot, however, be denied fitat the qualitative control 
technique, as described, suffers from certain shortcomings 
when compared with the quantitative technique. In the follow- 
ing paragraphs 6.8 to 6.11 two important points of difference 
between the qualitative and the quantitative methods will be 
discussed, and such modifications for the former suggested 
as will tend to bring it closer into line with the quantitative 
technique, and as will go far towards making it into a more 
reliable instrument of control. 

6.8 Qualitative Charts for Averages 

The quantitative or measurement chart technique is essentially 
a kind of analysis of variance. We derive the control limits from 
the variation within samples of n items, and judge the stability 
of the process, or the homogenity of the product, according 
to whether the number of sample averages within and without 
the control limits is in conformity with the proportions specified 
by those limits (“ within ” to “ without ” as 38 to 2 and 998 to 
2, respectively). 

The control limits of the qualitative or number defective 
charts, on the other hand, signify points in the binomial or 
Poisson expansion for samples (not sample averages !) of n' 
items.* That is, they signify values of the parent distribution 
of single samples of n', and not of the sampling distribudon of 
means. 

Such charts are suitable for controlling quality character- 
istics, provided the changes in the process are not too erradc. 

It lies, however, in the nature of accident distributions like 
those we are dealing with that the stability of the process is 
often poor from batch to batch, if not from sample to sample, 
In such cases, we do not derive much benefit from the usual 
type of number defective chart, because the frequently un- 

* We denote here the rtumber of individual items per sample by n', to dis- 
tinguish it Rom the number pf such samples, n 
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accountable ups and downs of the control limits, calculated 
from a number of samples, do 'not allow us to improve the 
process for any sizable period of time. A machine adjustment 
indi/jated by one or a few “ wilds ” might not have the desired 
effect, because these outliers are just “ vdlds ” and not indicative 
of the general quality level. 

The way out of the difficulty is to bring the qualitative 
control technique more closely into line with the quantitative 
technique by obtaining limits, not for single samples of n' , 
but for sample averages, and watch the distribution of such 
averages with regard lo whether it complies with the propor- 
tions for “ within ” and “ without ” implied by the position 
of the limits. 

In order to obtain such limits,' we are making use of two 
properties of the Poisson expansion. 

(1) The distribution of means of samples of n', picked at 
random from a Poisson population, follows the Poisson Law, 
just as the distribution of means in a normal population 
follows the Normal Law. But it is important to remember 
that the interval which in the parent Poisson distribution is 

unity changes in the sampling distribution of means to - for 

n 

samples of n.* 

* Notes for the mathematical reader : 

— A A*" 

(1) If the general term of the parent Poisson distribution is « " — , that of the 

r! 

sampling distribution of means i.s e ^ interval being i instead of unity. 


(2) If VW 

S«= ?? 

Var („) = 1 

4/r 4 

(see J. O. Irma, Journal Royal Statistical Society, vol. cxi, Part 11, 1943). 
For samples of n the interval becomes and we have 






Var (a) = 


Var (jf) 
4n.n 


4rt 



1 54 QUALITY CONTROL 

(2) The computation of the limits can be very much 
simplified by making use of the remarkable property of the 
Pc^sson distribution according to which the square root of the 
variate has a variance which is almost independent of the iFean, 
and is approximately constant, amounting to 0'25 times the 
interval. For the parent distribution, therefore, it is 0-25. 

Considering now that the distribution of means obeys the 
Poisson Law (1), it follows that if we transform the distribution 
of means into that of the square root of means, its variance will 
also be approximately constant and equal to 0*25 times the 

interval and its standard deviation 



1 


We would have obtained the same expression, — Chad we 

2^/h 

calculated the standard error of the mean from the standard 
deviation of the Vx distribution for individuals according to 

the formula -^=,. As o is 0-25 (see paragraph (2) above), 

Vn 

o = VO-26 == O' 5, and the standard error is given by 
Vn 

Within the distance, then, of Jh twice and ± three times 

that standard deviation, — L,- , there will lie approximately 

2 Vn 


38 out of 40 and 998 out of 1000 averages respectively, pro- 
vided the population is a simple Poisson distribution, that is, if 
the probability of a defective article remains the same through- 
out the process. A frequency of points beyond those hmits in 
greater than the assigned proportions would point to lack of 
stability, or to a changing probability of defectiveness. 

The use of the V^ distribution of means has been found 
very useful in cases which were intractable to the usual control . 
method, and the removal of what appears at first sight only 
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a lack of symmetry between the quantitative and the qualitative 
method of control has. thus proved to be a real remedy. 

6.9 Illustration of Usual Method When Applied to Mixed Lots 

As an illustration of what has been set forth in the pre- 
ceding paragraph, 6.8, let us take the following experiment. 

Four lots of a mass-produced article (machine-gun am- 
munition) were selected which were known to be 3 per cent., 

5 per cent., 7 per cent., and 10 per cent, defective, respectively, 
witli regard to the quality characteristic “ Form.” 

By means of tables of Random Numbers (see Chapter VIII, 
paragraph 8.6, for an explanation of such numbers) a scheme 
for obtaining a random sequence of samples from these lots 
was drawn up and samples of 20 picked at random from a 
lot, the lots being taken in the order of the random sampling 
plan (Table 6.1). 

We denote the 10 per cent., 7 per cent., 6 per cent., 3 per 
cent, defective lots by A, B, C, D , respectively. Then the 
order of occurrences of these letters in the random sampling 
scheme can be taken to represent a random sequence of 
samples (of 20) picked at stated intervals from a product which 
is likely to vary abruptly. For each occurrence of the letter 
indicating a certain lot, two samples were taken, implying that 
the change in quahty lasted for a certain short period of time. 

In a case like this where the quahty changes abruptly, 
that is, where it improves and deteriorates unaccountably in 
short bursts, there is little advantage in machine adjustments 
based on indications of a scanty changing quality level, because 
any such adjustment presupposes that one knows all about 
the cause of trouble and that the quality levels are more or 
less permanent, or at least that they can he maintained if 
satisfactory, and changed if not. 

The following chart now shows the picture after 120 
samples of 20 items each had been taken and limits calculated 
after every batch of twenty such samples (Fig. 6.4). 

We see that no point falls outside even the inner limits, but 
that these limits move up and down, which would suggest that 
certain periods of production were consistently better than others. 

We know, however, that this is not true for the product 
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Col. I Random Sampling Sequence (order m which samples are to be with- 
drawn ft om lots A, B, C, D) 

n 

Col. II Sample Quality (in 2 samples of 20 taken from a lot indicated by 

Sampling Scheme) 

The table shows how the lots are grouped in landom blocks of four and of eight 
in I il 1 II I II 
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as a whole, because Llie random sampling scheme assures a 
reasonably even distribution of sampling over all the four lots, 
and these lots have always been “ on tap ” while the sampling 
was going on. This information, then, is somewhat misleading, 
and machine adjustments on the basis of the chart would not 
be justified. On the other hand, the chart fails to indicate the 
lack of stability within those “ periods.” 

Now let us plot the number of defectives in samples of 20, 
if the samples come from one and the same lot in succession, 
but keeping to the order in which they actually occurred 
(Fig. 6.5). The physical interpretation is that the quality of 
the product was maintained at each of the four levels for a 
considerable time. (For order of samples, see Table 6.2.) 

The first batch (coming from 10 per cent, defective 
lot A) shows the highest limits. The second batch (from 10 
per cent, defective lot A and 7 per cent, defective lot B) shows 
an improvement. The limit lines for the tliird batch (mostly 
from lot B, some from C), however, have moved up, which 
suggests a deterioration of quality ■which is misleading. The 
fourth batch (from lot G, 5 per cent, defective) shows rightly a 
drop in the position of the limits. But the fifth (some from lot 
C, mostly from lot D, 3 per cent, defective) shows a rise of 
the limits, and thus again suggests a deterioration without a 
physical basis. The last batch (from lot D) has the lowest 
lines, in conformity with its quality level. 

What must be inferred from this is that a machine adjust- 
ment suggested by the ups and downs of the limits will not 
always have the desired effect. It might even throw the process 
out of gear. For processes with the type of variability described 
above, we must modify our control chart technique. 

6.10 Illustration of New Technique 

Let us now treat the same sampling data according to the 
new method outlined in paragraph 6.8. 

We first form samples of 4, as given by successive blocks 
of four values of our sampling scheme (see Table 6.1), and 
calculate their average defectiveness. We thus obtain the 
sample means of Table i0.3, column I, which are arranged 
into a frequency distribution in columns 11 and IV. 
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Non-Random Sequence of Samples of 20 (but within each lot in the order of 

oecurrence) 


The Table shows how the lots are grouped in non-random blocks of four and of 

eight 
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1 
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2 

1 

5 
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S 

2 

1 

3 

2 

2 

2 

2 

0 

2 

i 
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2 

0 
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1 

1 

0 
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2 

3 

) 
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1 
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1 

1 

2 

1 
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3 

1 

2 
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1 

0 

2 

o 


0 

J 

1 

1 


0 

1 
0 
0 

0 

1 

2 
1 

X 

0 

1 

0 

1 

2 

0 

0 

3 

1 

0 

2 

2 

1 

2 
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D (3 %) 
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0 

1 

1 


0 

1 

0 

1 


1 

0 

1 

2 

1 

0 

1 

0 

0 

1 

0 

] 

0 

1 

0 

2 

1 

0 

1 

1 

] 

1 

30 


1 

The number at the foot of each column X 

represents one sample number per lot — 

34 
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Table 0.3 


Col. I Sample mcans of Random Blocks of 4 in the order in which they 
arose (rj 

Cols. II, III, IV, V Frequency distribution of sample means 


I 

II 

III 

IV 

V 


■’/ 

V-r,- 

Ji 

fiS/~ 

1-76 

0 

0 

— , 



I-OO 

-76 

■25 

■00 

1 

■50 

1-25 

-75 

■50 

„ -70 

3 

2-13 

-50 

1-00 

-75 

■S7 

3 

2-01 

-25 

1-UO 

1-00 

1-00 

S 

800 

1-00 

-50 

1-26 

1-12 

5 

o-no 

2-00 

2-76 

1-50 

1-225 

2 

2-45 

-50 

1-23 

1-75 

1-32 

3 

3-H6 

1-01) 

1-50 

2-00 

1-414 

3 

4-24 

1-00 

1-70 

2-25 

1-50 

■ 


-75 

1-24 

2-30 

1-08 

__ 


11-00 

1-23 

2-75 

1-00 

1 

1-80 

1-00 

1-20 

3-00 

1-73 

1 

1-73 

1-75 

1-40 

1-60 

1- no 

2- 0(1 
2-00 

30 

M = 

VX 

From cols. II, TV we get 

and M_ = 

-V 

32-38 

32-88/30 = 1- 

38-25 

38-25/30 = 1 


Now since = 


_ 


(T _ = 

/Jm 


30 

^fiW'sY 


30 




1-275 - 1-21 = 0-005 


and O' = 0-265 

Expected values ; ir ’ = 0-0025, o- = 0-26 

Similarly, for sample means of non-random blocks of i : 

M- = 1-276, M„= 1-OS 




= 1-275 - 1-17 
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«y.r 

tr = -324 


101 



102 


Q^UALITY CONTROL 


Next, the series of values is formed (column III), and 
their average X is calculated in the usual way, resulting as 
I-IO. The standard deviation for the ■sjx distribution is 
then obtained according to the formula ^ 




'Lx 

(L^/'xV 

N 

\ N / 


as -v/O-OSS = 


First, compare how close this observed value of the variance 


is to the theoretical value of i X ^ or 

4 n 


0-25 X 0-25 = 0'0626. 


One might conceivably be surprised that this is the case, 
considering that our samples do not come from one but from 
different Poisson distributions, representing four different 
levels of probability for producing defectives. But let us 
remember that our method of sampling has to some extent 
restored the random element.* 

It is, however, not the observed standard deviation but 
its theoretical value of 0-25 which we use, once for all, for 
arriving at the control limits for averages. We thus obtain 
the 1/40 and I/IOOO limits as ITO d; 0-50 and ITO ±0-75 
respectively. These limits fulfil the same purpose as the limits 
on the quantitative chart. They denote the width of variation 
for averages of n samples of n' specimen that will result by 
pure chance if the population is a simple Poisson distribution. 
Any excess in the number of averages beyond these lines in- 
dicates a disturbance or, as we say, lack of control. 

The following figure shows the new chart for the distribu- 
tion of Vx, or the square root of means (|f four successive 
samples of 20. 

We find 3 ‘averages out of 30 falling outside the 1 /40 limits, 
which suggests lack of .stability. This puts on record the erratic 
behaviour in short intervals, true to facts. 

Now let us form the averages of samples of 4, if all four 
samples come from one aild the same lot, taking the samples 
again in the order in which we happened to obtain them, but 
having first sorted the samples according to the lot from which 
they came (see Table 6.2). 


* see Czuber, WahrseheidiskkeitsUhfe, pp. 133-36 
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Fig. 0.() 

Number Defective Chart for the Square Root of Averages of 4 Sample.s 
of 20 ; Random Sampling 


The Standard deviation of the VJ distribution is now 
somewhat higher (0-324), pointing clearly to the presence of 
disturbing factors ; the chart (Fig. 6.7), with limits derived 
from the theoretical standard error for a simple or undisturbed 
Poisson distribution, shows again 3 points falling outside the 
] /40 limits. 



Fig. 5.7 

Number Defeclive Chart for the Square Root of Averages of i Samples 
of 20 ; Non-random Sampling 


The impression we get from the charts for sample averages 
is to some extent tire reverse of that gained from the previous 
charts for single samples. 

There we got the impression of changes in the quality level 
combined with stability or homogeneity within each of these 
periods. Here we get the picture of a somewhat unstable pro- 
cess in the case of random sampling, or, in the non-random 
chart as changing with the lot quaUty. 

Tliis is no doubt a nearer approach to truth, that is, to the 
physical conditions, because, as we know, the samples do not 
come from a homogeneous population, But it is still not what 
we want, because even the new charts do not enable us to 
distinguish between accidental variations, which ought not to 
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entail machine adjustments, and consistently bad quality 
necessitating such adjustments. 

But this is not inherent in the new method. It is due to 
our Working with rather small samples. The greater^ the 
number of samples whose averages we observe, the more 
sensitive does the method become to the vital distinction 
between erratic and systematic changes of quality. 

Let us, therefore, collect the samples into groups of 8, 
first taking blocks of 8 from our random sequence, and then 
taking 8 samples from one and the same lot or from adjoining 
lots, in the order in which those samples have occurred (see 
Tables 6.1, 6.2). The following Table 6.4 and charts show the 
distribution of averages of 8, from the random sequence and 
the non-random sequence respectively. 



Fig. 6.8 

Number Defective Chart for the Square Root of Averages of 8 Samples 
of 20 ; Random Sampling 



Fig. 6.9 

Number Defective Chart for the Square Root of Averages of 8 Samples 
of 20 ; Non-random Sampling 


These charts now give a definitely better picture of the 
actual physical conditions prevaiUng during the time of pro- 
duction covered by our sampling. ? 



Table 6.4. 

Cul. I. Sample inean.s of Random Blocks of 8 in the ordei" in which 
they arose (r-,) 

Cols, II, III, IV, V, Frequency distribution of sample meaps. 


I 

1-376 


11 

•fy 

III 

V-T/ 

IV 

fi 

v_ 

1-000 


0 

0 

— 

— 

■025 


-025 

■790 

2 

l-iiSO 

•625 


-760 

-866 

— 



1-000 


-.375 

•935 

— 

. — . 

1-260 


1-000 

1-000 

2 

2-000 

1-625 


1-125 

1-061 

2 

2-122 

1-126 


1-260 

1-118 

4 

4-472 

1-260 


1-376 

M72 

1 

M72 

1-260 


1-500 

1-225 

1 

1-225 

2-126 


1-625 

1-272 

1 

1-272 

1-125 


1-760 

1-323 

— 

— 

1-600 


1-870 

1-469 

— 

— 

1-260 


2-000 

1-414 

1 

1-414 

2-000 


2-126 

1-467 

1 

1-457 





16 

16-714 



M = 
V.? 

16-714/16 = 

1-114 


From cols. 11, IV 

M 

11 

19-125 




and 


19-126/15 = 

1-276 


Now since 

M, = 

15 



sjx 

S/y (-s/iY 
~ 15 

-(V 

1-276 - 1-2-t = 0-035, 


and 

(7 = 

0-187 









Expected values : 

(t“ = 
V-r 

0-03126, o-_ 
' xls 

^ 0*177 



Similarly, for sample means of non-randtm block? of 8 : 


M- = 1-276 M = MO 

o." = 1-275 ~ 1-21 = 0-066 

fJs 

o- = 0-266 

s/W 


( 012 ) 


IGS 


12 
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The random chart, corresponding to a record of quality 
for a product improving and deteriorating by jumps which, 
according to the production engineer, must be considered 
“ unaccountable ” because of the complexity of interacting 
factors, and which, therefore, should not suggest the need for 
altering the machine or the operation, shows none of the averages 
falling beyond even the inner limits. The level of production 
seems maintained, as the same pair of limits serves for the whole. 

The non-random chart, on the other hand, has three out- 
liers : one from the best lot (D, 3 per cent, defective), which 
practically cannot be regarded as a “ wild,” since it occurs 
beyond the lower limit, and two from the worst (A, 10 per cent, 
defective), thus pointing clearly to where the trouble lies. The 
trend of quality is now quite unmistakable. Again, we have the 
advantage of being able to work to one and the same pair of 
statistical control limits throughout. 

We are not any longer misled by the limits changing on 
account of sample quality only, and are thus in a position to 
differentiate between such ephemeral changes of sample quality 
and the more permanent changes in lot quality. 

The limits on these charts were obtained from the constant 
standard error of the mean in samples of 8 in a simple — that is, 
undisturbed — Poisson distribution. That quantity is given by 

VJ X |- = O' 177. The observed standard deviation for the 
distribution of chart 6.8 (random sequence of averages) 

results as V0'035 = 0-187, which is sufficiently close to the 
theoretical value to justify our assumption of a stable process. 

The standard deviation of the s/x distribution of chart 
6.9 (non-random sequence of averages) is VO-OGS = 0*256, 
indicating disturbance, and thus confirming the conclusions 
reached by the control chart method. The control limits are 
the same as in the random sequence chart, because they are 

obtained from the constant value of the variance of the Vj 
distribution for averages of 8 samples. 

It emerges from these examples that the 1/40 limits are 
more sensitive to the distinction between stabihty and dis- 
turbance than the l/IOOO Hmits if we work with averages of 
samples of «' specimens. 
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The control limits being always at the same distance from 
the mean, it goes without saying that the question of economic 
control, tlrat is, whether the quality of the product is as desired, 
musj be decided solely by the position of the mean. 

For the purpose of routine examination we cannot wait 
until we know the average for the whole, or a very long, period 
of manufacture. This would defeat the object of a synchronized 
check on the quality of the article as it leaves the production 
line. We therefore use in this case, to start with, as the mean 
from which to set out the control limits, the best attainable 
average quality of the characteristic in question, expressed as 
a number defectiveness. 

6,11 Technique for Qualitative Charts if No Defectives are to be 
Tolerated * 

(1) Another important difference between any type of 
qualitative chart and the quantitative one is this : working 
to control limits on a quantitative chart will, provided the 
limits are in the position required by specifications, result in a 
product free from rejects. With qualitative charts, on the 
other hand, no matter where the limits are which we obtain 
from the observations (excepting the case of the 1/1000 limit 
at zero), there will be a certain percentage of defective work 
included in the bulk of the product. 

The reason is that only in the case where the average 
defectiveness in samples of n' is zero can we expect limits 
at 0, but whenever the average defectiveness in such samples 
is ever so small a fraction above 0, there will be 1/40 and 
1/1000 limits at points of the scale appreciably higher than 
zero. This means that samples with a defectiveness not exceed- 
ing such limits will not lead to machine adjustments or other 
corrective measures. The result, then, will be the inclusion 
of a certain number of defective articles in the total product. 

To summarize : limits other than at zero are essential for 
stabilizing or controlling the manufacturing process, and such 
limits imply the acceptance of a certain percentage of defective 
work. 

(2) If it were essential, hoWever, that no defective work 

* see Dudding and Jennett, Qualilji Cviarol Chart Technique when Manujacimng 
to Specifmlions, 1944 
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should leave the factory, then it would be necessary, and under 
certain conditions possible, to modify the basic method of 
control so as to achieve that aim. But here we must distinguish 
between low precision and high precision work. The first type 
includes processes with a high variability, the second processes 
with a small variability. 

(a) For the first type there is no other way but to work 
to the quality control chart limits, either for single samples 
of n' or for averages of a number (n) of samples of n', keeping 
in mind a target of a very small percentage defectiveness (see 
paragraph 6.7), with subsequent 100 per cent, insijection and 
clearing out of defectives. 

(b) If, however, the process was such that rejects could 
be avoided altogether provided sufficient care was taken, and 
if the characteristic was measurable, the following modification 
of the standard method should be adopted. 

Its principle is this : limits or gauges are respectively 
set or made more severe than specifications demand. For 
instance, we make ffie size of limits or gauges such as to cause 
10 per cent, of the satisfactory product according to the speci- 
fications not to pass the gauge or limit. The control lines are 
then obtained for such a mean “ defectiveness ” per sample of 
n' specimen as would coiTespond to 10 per cent, defectiveness 

in the total ; that is, we expect a sample of n' to contain — 

“ defectives,” admitting at the same time deviations from that 
mean value up the to conti'ol lines in decreasing proportions. 
The term “ defective ” is here understood as referring to the 
limits or gauges of increased severity, and not with regard to 
the specifications. 

Working to these control limits, we can be satisfied that a 
stable process will mean that no part of the product is beyond 
specifications, 

(3) To fix our ideas, let us think of the length of a certain 
article, say, metal rods, for which an upper tolerance — 1 „ — 
and a lower tolerance — ti — are given, equidistant from a 
nominal dimension which we call the specification mean (M). 
Let us further suppose, in accordance vrith (6), that the process 
is capable of high precision work, that iS) that the specifica- 
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tions can definitely be met in mass production, and that the 
characteristic is measurable. 

We must first determine the size of the gauge or the limits to 
which to work (working limits in contradistinction to specifica- 
tion limits). We assume that the measurements will be dis- 
tributed in accordance with the Normal Law, one tail repre- 
senting “high” and the other “low” values. We know 
that in consequence of this (see Appendix I) the highest 10 
per cent, of the product will lie beyond -)- l’28ff and the 
lowest 10 per cent, beyond — l-28crfrom tlie specification mean. 


But, l'28ff = ^ 
dn 


1-28 

2 ^ 




= -62 Wi. 


a)„ which, for samples of, say, 4, becomes 


[Wi, or any other w^, is determined empirically in the usual 
way as the average range of, say, 20 to 50 samples of 4, or 
generally of n. This c[uantity must be frequently checked, 
and the working limits altered if a significant change in 
has occurred. It is obvious that gauges will be made according 
to severer limits only if one is satisfied that remains, on the 

whole, constant and is equal to or less than 


Setting out ± ■62zei4 from the specification mean, then, 
gives the limits — and the dimensions of the gauge — ^which will 
“reject,” or not pass, 10 per cent, of the product on the high 
and 10 per cent, on the low side ij the total was within specifications. 

Working with samples of, say, 20 specimens, we now expect 
10 per cent, defectiveness in the bulk according to our working 
limits (and gauges) for either aspect of length, and therefore 
2 in 20 not to pass the “ Go,” and 2 in 20 not to pass the “ No 
Go ” part of the gauge. 

We have to keep two charts, one for high, the other for low 
defectives, and examine each sample of 20 witli regard to both 
characteristics. The sample quality is then plotted on the 
respective charts. 

The 1/40 and 1/1000 limits for a mean defectiveness of 
2 are, according to the Poisson summation, 6 and 7 respectively. 
These limits are put on each chart and remain the same through- 
out the manufacturing process. It is now these limits to whicli 
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we Work, and wc can be confident that a stable process as 
governed by these limits will result in 100 per cent, perfect 
product, that is, product free from real rejects. 

(4) For a process of very high precision, that is, with a 
practically constant and very small standard deviation, we can 
allow our safety limit to come nearer to the tolerances without 
incurring the risk of turning out bad work. The case is re- 
miniscent of the modified technique for quantitative charts 
(see paragraph 2.11). The range zti„ being smaller than it 
need be according to the distance — ij, we can allow the 
mean to vary more than would be accountable by mere 
fluctuations of sampling. This we achieve by setting in from the 
tolerartces the distance corresponding to the which resulted 
from our observations. 

We know again that the lowest, 10 per cent, say, will lie 
beyond — 1'28 (t,, from the extreme mean on the low, and the 
highest 10 per cent, beyond -f l*28ojfrom the extreme mean on 
the high side. We do not know, however, the exact position of 
these extreme means. All we know is that they must be 3-09ff« 
distant from the respective tolerances if the product is still to 
be within specifications. That distance from the tolerance 
can then be obtained in the following way : the difference 
3-09crj — l’28crj = I'SICTj represents the distance expressed as 
a multiple of nearer than which no observation should 
approach a tolerance. 


But l-Slc, 

or, for samples of, say, 4 : 


1-81 




Wj == -88 w.. 

2-06 * 


This quantity is now set in from each tolerance to give us 
the new limits or the new gauge dimensions. For each length 
aspect, high and low, we can again expect 10 per cent, “defec- 
tiveness ” in the bulk according to these limits, and for samples 
of 20 we obtain the 1/40 and I/IOOO limits on each chart again 
as 6 and 7 respectively. These limits are then worked to in 
the same way as in the previous case, and with the same 
consequences. 
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" PROBABILITY OF CAUSES— STATISTICAL 
SAMPLING INSPECTION 

7.1 Judging Lot from Sample Quality : How Not To Do It 

The sections on Quality Control proper — Chapters II and VI 
— were concerned with controlling die quality of the product 
during the process of manufacture. In the chapter on pro- 
bability graphs. Chapter III, a method was given specially 
suited for examining incoming components, or for examining 
the finished product, if quantitative measuring of the product 
is feasible and advisable. As a rule, however, the inspection 
of the incoming components or the finished product is done in 
another and more accurate way. 

As we know from Chapter VI, for measurable, or at least 
gaugeable, characteristics hand- or machine-gauges are used 
which will pass the product with regard to a certain character- 
istic only if the dimension of that characteristic is as the specifi- 
cation requires it to be. As a rule, the product must not be so 
great as not to fit into the gauge, and it must not be so small as 
to fit into the “ forbidden ” part of the gauge. For visual or 
functioning defects the product is subjected to a close visual 
scrutiny or to the “ supreme ” functioning test. 

The method in use at present in industrial work consists in 
taking a sample of an arbitrary size (provided 100 per cent, 
testing is not made the rule), putting each item through 
the gauge, or otherwise ascertaining its quality, and taking 
the percentage of the failures in the sample to represent the 
percentage of failures in the lot. Let us assume that the lot 
contains 100,000 pieces, and that a sample of 50 is picked out 
and put through the gauge. If in the sample of 50, say, 1 
defective is found, it is then reasoned that the lot of 100,000 is 
2 per cent, defective. 

7.2 Definiteness v. Correctness of Estimate 

It is quite evident that this method ignores the influence 
of chance upon the sample quality. The smaller the sample 

m 
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size, the less likely is it that the sample quality represents the 

true lot quality. 

Let us think of a great number of different lots of one and 
the same article, whose quality — as judged by the number of 
defectives — will naturally somewhat differ from one another. 
Let us further assume that samples of 100 are picked out from 
each lot, and the sample quality taken to represent exactly the 
lot quality. It can be safely stated that the conclusions reached 
in this way as to the quality of the respective lots will all be 
wrong, or at least the majority of them. This is the method 
which at present is actually in use in industrial sampling 
inspection. 

But if, on the one hand, we must not identify sample with 
lot quality, so, on the other, it would not be of much use if we 
Were to set out to find a method giving us complete certainty 
as to the lot quality if judged by that of a small sample. When- 
ever we judge lot quality from sample quality, we cannot do so 
with 100 per cent, confidence. If we wanted 100 per cent, 
confidence we would have to say that all these different lots 
may simply be anything between 0 to 100 per cent, defective, 
or very nearly so. Now, an answer of this kind, although 
theoretically quite correct, would be just as useless in practical 
work as the answer which is usually obtained is incorrect. 
In brief, the higher the correctness which is to be attained, 
tile more details must be sacrificed, but this must not go so far 
as to render the answer useless. 

All this points clearly to the desirability of having a method 
of obtaining the answer as to lot quality with a certain accept- 
able degree of probability. Is there such a method ? It is 
clear that a positive answer can only be expected if we know 
the Law of Probability which connects lot and sample quality. 
In fact, we know that law, and we can derive from it the 
an.swcr that if we pick a sample of a certain size from a lot of 
a gjven size and find so and so many defectives in the sample, 
then the probability is so and so much that the lot is of such 
and such quality. We shall later use a more cautious word- 
ing so as not to clash with certain objections raised against 
what ‘ is ,called “ Inverse Probability ” — arguing from the 
sample to the population» 
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7.3 Demonstrating the Way in which Small Sample Quality will 
Deviate from that of the Lot 

Before wc consider the application of this law to industrial 
measurements, let us get a general idea of what it is like. An 
apparatus was constructed in the Ballistic Research Laboratory 
of the American Ordnance Department to demonstrate that 
small samples showing a better quality than the lot of moderate defective- 
ness will be more frequent than samples showing a worse quality. 

To demonstrate this fact visually, a machine was constructed 
of which a sketch is given on page 17 4. The working of the 
machine is as follows. “ The hopper of the machine is loaded 
with 900 copper-plated steel balls and lOO chromium-plated steel 
balls (the latter being regarded as defectives). The adjustable 
mechanism on the front of the machine is set for sample size 10, 
and a spectator is invited to operate the machine. The steel 
balls are continually agitated during sampling. The samples 
of 10 flow into the slot shown on the face of the machine. The 
operator can tell at a glance whether the sample is better than 
the lot (perfect), the same as the lot (1 defective in 10), or poorer 
than the lot (more than 1 defective in 10), By rotating the 
sampling disc, samples which are better than the lot are caused 
to flow into the right-hand test tube ; samples which are the 
same as the lot into the middle test tube ; and samples which 
are poorer than the lot into the left-hand test tube. The 
picture shows a typical run. Results are subject to sampling 
fluctuations, of course, and seldom are 36 per cent., 39 per cent,, 
and 26 per cent. * ; but out of a great many runs, there has never 
been (and incidentally could not be) a case in which samples 
poorer than the lot exceeded samples better than the lot.” 
(Simon, An Engineer’s Manual of Statistical Methods, 1941.) 

7.4 The Sernoullian Law of Chance: One “Cause” May Have 
Different “ Effects ” 

In this case, only three qualities of sample were considered, 
but as a matter of fact, a sample of 10 as used in this case may 
have anything from 0 to 10 defectives, and we thus get 11 
groups of sample quality. The distribution of these samples will 

* This is the theoretical proportion of samples which are better than the lot, 
like the lot, and worse than the lot, respectively 
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Apparatus described by L. B. Simon ia An -Engineer’s Manual of Slaiislual 
Methods (New York, ig^i) 
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be roughly as follows ; a high percentage will be just like the 
lot, a greater number of them will be slightly better than lot 
quality, and a somewhat smaller number slightly worse than 
lot OjUality. As the samples get increasingly worse or better 
than the lot quality they become correspondingly less frequent. 
This causes a distribution shaped very much like the one we 
know already — ^the normal distribution. It is sometimes called 
after its discoverer the Bernoullian Law, and after its mathe- 
matical structure the Binomial Law.* 

Let us now see what the law is like numerically. The 
following experiment can easily be made. 

We mix white and black steel balls having J in. diameter 
in very great quantity, and make the mixture contain 60 per 
cent, white and 60 per cent, black. From that lot let us pick 
samples of 4. These sets of 4 can be of the following kinds : 

4 white. 

3 ,, 1 black 


4 black 


We may thus tliink of the lot quality as being a “ cause ” 
which can have not one but five different effects. 

An experiment was carried out by picking 100 samples of 
4 from the lot, and the following figures were obtained : 

4 white 4 

3 „ 1 black 30 

2 „ 2 38 

1 » 3 „ 20 

4 black 8 


* If in the expression {p -t })", called a Binomial, p stands for the proportion 
of “ .successes,” j = 1 — ^ for the proportion of “ failures ” in the lot, and n 
for the sample size, then the probabilities of samples of n containing 0, 1, 2, 
. , . n “ failures ” are given by the successive terms of the expansion of the 

Binomial, p" -f np’'~^ q + ’” 7 , + ■ • • + 9 ". 
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A second experiment gave this series : 
4 white 

3 „ 1 black 

2 2 
“ 3) ^ ii * 

1 „ 3 „ . 


4 black 


20 

46 

28 

4 


It is possible to calculate the frequency of each type of 
sample by means of the law referred to above,* and we then 
obtain the following smooth symmetrical series : 


4 white 

3 „ 

2 „ 

1 u 

4 black 


1 black 

2 „ . 

3 „ . 


6 

25 
38 

26 
C 


Although there are deviations between our observed series 
and the theoretical one, they are not so great as to obliterate the 
regularity due to the law, and they can easily be accounted 
for by 

(a) the small number of samples (only 100). Had we taken 
about 10,000 samples of 4 we could have got a very good 
approximation to the theoretical series ; and 

(b) by slightly irregular shapes of the steel balls, 'and possibly 
also by the arrangement which prevents complete random- 
ness of sampling. 

Now let us change the “ cause ” by mixing 40 per cent, blacks 
with 60 per cent, whites, and again picking a great number of 
samples of 4 and aiTanging them according to their quality. 
We then obtain by theory the following series, which can easily 
be checked by a suitable experiment : 


4 white 

« . 





. 13 

3 „ 

1 black 





. 36 

2 „ 

2 „ . 



• 


. 36 

1 » 

3 „ , 



. 


. 15 

4 black 

. 



. 


2 


* lQ0(i + 4) * ® 100(/g. + + fs + ^ + f (j) == 6-28 + 25 + 37'S + 25 + 6.25 
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Wc sec that the percentage of the different sample qualities 
has changed, but the sample qualities as such have remained 
the same. We have again tire same five types of sample quality 
or ” effects ” as above. 

7.5 One “Effect” May Have Different “Causes” — The Probability 
of Causes 

It is now easy to sec how difficult it is to judge from any 
sample quality what the lot quality is like. 

Let us pick out the sample quality 3 white, 1 black. If we 
obtained such a sample from a given lot, the lot could either 
be our first 60/60 lot or our second 40/60 lot, in fact, it could 
have come from a lot of any quality except one containing only 
whites or only blacks. 

Just as one “cause” could have five different “effects,” so 
one effect could have been produced by any of a great number 
of causes. What distinguishes all tlicse possibilities from one 
another is only the different probabilities for a sample of a 
given quality to have come from lots of different quality. In 
our case, a sample of 3 white, 1 black, has the probability ‘26 
of having come from our first lot, and the probability -SS of 
having come from our second lot. 

This shows already the way towards solving our initial 
problem, namely, how to infer from a sample of a given size 
and a given quality the quality of the lot, with a certain 
acceptable amount of probability. From the quality of the 
sample we can determine the probability that a certain 
“ cause ” will have given rise to such a sample, even if 
wc cannot directly estimate the probability of that " cause,” 
Generally speaking the probabifrty of a sample quality 
arising out of the various / possible lot qualities will be a 
maximum for lots whose percentage defectiveness is nearly the 
same as that of the given sample. This fact forms the basis for 
judging lot quality from sample quality as it is generally prac- 
ticed. Nothing, however, justifies the expectation that the 
random sample taken should be just one of the most probable 
groups of samples. We neglect in this way the spread of sample' 
qualities, or for a given sample, the spread of lot qualities that 
could give rise to such a sample. To make our estimate more 
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true to facts the statement about the most probable “ cause ” 
should be supplemented by taking the spread of quality into 
account, and stating the range of lot quality from which with a 
high probability our sample could have arisen. 

7.6 Simon’s Charts — Description and Instructions for Their Use 

If we had to carry out calculations of that kind it would take 
up too much of omr time, and would also be too difficult, but 
fortunately the information which we want has been condensed 
into the form of charts. These charts have a twofold purpose. 
If we know the quality of the lot we can, from these charts, read 
quickly the range of quality we must expect in samples of a given 
siae. On the other hand, if we know the quality of a sample of 
a given size we are able to infer from it the lot quality within 
certain limits, i.e. the range of the lots of different quality which 
with a certain stated amount of probability would give rise to 
such a sample quality. 

Let us assume a lot of articles is inspected for a certain 
characteristic by taking a number of samples, say several 
hundred, and counting the number of defects in them — that 
is, the number of items in which the characteristic in question 
does not come up to specification. 

To fix our ideas, let us think of a sample of 300 shell fuses, 
and let us assume that 6 defectives — that is, 6 fuses which did 
not fire — ^were found in it. 

As a rule, and without using modern statistical methods, one 
Would conclude that if a sample of 300 contains 6 defectives, 
and is therefore 2 per cent, defective, the lot must also be 2 per 
cent, defective. But this way of estimating the lot quality is 
quite wrong. It has already been mentioned that if we draw 
conclusions from sample quality as to lot quality, and if we want 
to come near the truth, the more definite and unequivocal we 
make our estimate, the further we shall be from the truth. If 
we want to get a really reliable estimate of the lot quality as 
judged from a comparatively small sample, we can do so only 
with a certain amount of probability. This is what Simon’s 
charts * help us to do (see L. E. Simon, An Engineer’s Manual of 
Statistical Method, New York, 1941). 

* They arc based upon a complicated function derived from the Bernoullian 
Law, arid called the Incomplete jS»Function Ratio 
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{a) Inference of lot quality fom sample . — There are five charts 
from each of which to draw conclusions about the relation of 
lot and sample quality with different amounts of probability. 
We s,tart with the chart marked “ Iq = -5.” The horizontal 
scale gives the sample size, the vertical scale gives the lot quality, 
and the curves represent the number of defectives in a sample. 
Entering the chart with sample size 300, and projecting the 
crossing-point with the curve for 0 defeedves to the left, we 
read a defectiveness of 2-2 per cent. This is in harmony with 
the fact pointed out above, that in comparatively small samples 
tire lot quality will always appear a little better than it really is. 
It is important to realize that it is only with 60 per cent, prob- 
ability that we can draw this conclusion about the lot being 
2-2 per cent, defective. In other words, the chances are even of 
its being better or worse. But just because of this element of 
uncertainty, it comes much nearer to the truth than simply 
making the assumption that the lot quality is identical with the 
sample quality. 

According to what was said above, if we want to come 
nearer to certainty, we must sacrifice some of the definiteness 
of information and accept a range of lot qualities, with regard 
to percentage defectiveness, within which the true value will lie. 
Such a range we obtain from the charts marked “ Iq = •! ” 
and “ Iq = -9.” The general construction of the charts cor- 
responds to that already explained for chart Iq = -6. 

We enter chart " Iq = T ” witlr sample size 300, and 
projecting the crossing-point with the curve for 6 defectives 
to the left, we read a defectiveness of -013 or l-S per cent. 
This means that there is a probability of 90 per cent, that a 
sample of 300 with 6 defectives in it will come from a lot 
which is 1'3 per cent, defective or worse, and, conversely, that 
there is a 10 per cent, probability of the sample coming from 
a lot which is better than 1-3 per cent, defective. 

Going over to chart “ Iq = -9 ” and proceeding in the same 
way as before — that is, entering the chart with sample size 300, 
projecting the crossing-point with the curve for 6 + 1 {on this 
chart !) defectives to the left, we read -036 or 3'6 per cent. 

* Appendix VII 
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defectiveness of the lot. This means there is a probability of 
90 per cent, that a sample of this size and quality will come 
from a lot that is not worse than approximately 3'6 per cent, 
defective ; and conversely that there is a probability of 10 per 
cent, that the lot is worse. 

It we wanted to obtain still higher certainty — ^that is, if 
we are not even satisfied with drawing our conclusion with 
90 per cent, probability — ^we have to use the chart marked 
“ ly = -005 ” and “ Iq — -995,” proceeding in the same way 
as before. We read on the chart “ Iq = -005” for a sample 
of 300 with 6 defectives in it, -0095 — ^that is, less than 1 per cent, 
defectiveness. Tfris means that with a probability of 99-5 per 
cent, we can say that a sample of this size and quality will 
come from a lot which is worse tlian 1 per cent, defective ; and 
conversely that there is only a probability of ^ per cent, that 
it will come from a lot much less than 1 per cent, defective. 

On the chart Iq = -995, proceeding in the same way as 
before, we read a lot quality of 5 per cent, defectiveness. This 
means that with a probability of 99-6 per cent, a sample of this 
size and quality will come from a lot that is no worse than 
6 per cent, defective ; and conversely that there is only per 
cent, probability that the lot will be worse. 

We need not, however, draw our conclusions with such high 
probability as given by the last two charts. For practical 
purposes it is quite sufficient to work only with the -6, T, and -9 
charts, and we then arrive for our sample at the conclusion that 
the lot quality is most probably 2-2 per cent, defective, with a 
possible range of from 1-3 to 3-6 per cent. (The probability 
of our sample coming from a lot of between 1-3 per cent, and 
3- 5 per cent, defectiveness is 80 per cent., according to how 
two probabilities are combined, viz. -90 x -90 = •81.) 

This information, although less definite than an absolute 
figure, is much more reliable and will, in the long run, fully 
justify itself. 

{b) Inference of range of sample quality from lot defectiveness . — 
Tfre charts are suitable for providing a second kind of in- 
formation, which can be used for constructing a sort of quality 
control chart, 

Supposing specifications were laid down for a certain 
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characteristic to the eficct that a lot must not be worse than a 
stated amount of defectiveness with regard to that character- 
istic. To fix our ideas let us again work with a numerical 
example, and let us assume now that 2 per cent, defectiveness 
was the limit of acceptance with regard to that characteristic. 

If we are faced with having to judge lot quality from sample 
quality, it is quite evident that the number of defectives for 
which to allow in the sample with regard to that acceptable 
amount of defectiveness of 2 per cent, will vary with the sample 
size, but the way in which they vary is not so simple as is usually 
assumed. We might be tempted to say that if the defectiveness 
in the lot should not be worse than 2 per cent., and if we were 
to take samples of 100, we might allow up to 2 defectives — or, 
in samples of 200, up to 4 defectives — but this would be wrong. 
We must again take the chances of sampling fully into account. 

Let us assume samples of 400 were taken. How much 
should we allow them to differ in quality without suspecting 
the lot quality to be worse than 2 per cent. ? This is the funda- 
mental problem. 

Taking again the charts “ Iq = <1 ” and “ Iq = -9,” we 
can read these charts in an inverse way compared with what 
we did before. Wc again enter the chart “ Iq = -9 ” with the 
sample size — this time 400 — ^but now we determine the crossing- 
point with the horizontal line erected at the acceptable lot- 
defectiveness — that is, at -02, and the nearest curve to that 
crossing-point will give us the number of defectives to accept as 
a lower limit. In the given case we read 5 defectives. 

This means that in a lot of 2 per cent, defectiveness we can, 
with 90 per cent, certainty, expect a sample of 400 to contain 
no less than 6 defectives ; and conversely that with a prob- 
ability of 10 per cent, we can expect less than 5 defectives. 

To obtain the upper limit of defectiveness we go to chart 
“ Iq = -1,” and there we read for a sample size of 400 and 
lot defectiveness -02 an upper limit of sample defectives of 11. 

This means that, with a certainty of 90 per cent., we can 
expect that a sample of 400 picked at random from a lot 
of 2 per cent, defectiveness will not contain more than 11 
defectives ; and conversely that there is only a probability of 
10 per cent, that it will contain more than 11 defectives. 

I g 
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(c) Control charts for sampling inspection . — ^This information 
can now be used for constructing a quality control chart for suc- 
cessive lot qualities. Let us assume we have sampled lot after 
lot by taking samples of 400 from each, counting the defectives 
with regard to our characteristic, and that the following series 
of sample qualities was obtained : 

7, 8, 8, 5, 10, 8, 9, 10, 6, 6, 13, 11, 8, 8. 

We could, of course, without any transformation into a 
control chart, see at once from Simon’s chart that all samples 
except the one with 13 defectives could have arisen with a 
sufficiently high probability out of a lot which was no worse 
than the stated defectiveness of 2 per cent. 



Fiu. 7.2 

Quality Control Chart for judging Stability or Otheiwisc of Lot Quality 

There are, however, certain advantages in plotting the 
results in the form of a chart, especially as it enables one to 
discover trends which otlrerwise might escape attention. 

To consti'uct the chart, we mark the sequence of lots on the 
horizontal scale and the quality of the sample on the vertical 
scale. We then draw a line at 5 defecdves as a lower limit, 
a line at 11 defectives as an upper limit, and now simply plot 
the sample quality by putting dots at the crossing-point of the 
ordinates for lot number and .sample quality. We then see that 
sample number 11 points to a quality of the lot worse than 
acceptable. 

This, however, does not mean, in the light of what we said 
above, that the lot must definitely be worse than 2 per cent. 
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It must be kept in niind that we draw our conclusions only with 
a certain amount of probability, however high, A lot outside 
the limits should therefore be subjected to a closer scrutiny 
before a final decision is reached. 

It should be observed that the control lines on this chart cor- 
respond to different probability levels than do the control 
lines on the ordinary control chart. Simon’s charts “I^ = •! ” 
and “ Iq = "9 ” allow us to draw conclusions with 90 per 
cent, probability each, that is, we draw with 90 per cent, 
probability the conclusion that a sample will be not worse than 
a certain amount of defectiveness, and we draw with 90 per 
cent, probability the conclusion that it will be no better than 
a certain amount of defectiveness. If now we want to obtain 
the probability that a sample will simultaneously comply 
with both these requirements, we have to multiply the 
probabilities in order to obtain the combined probability. 
•90 X -90 = '81, approximately 80 per cent. It follows that 
the band on the control chart between our two lines represents 
80 per cent, of our distribution of lot qualities, with 10 per cent, 
of the distribution lying beyond it on either side. 

7.7 Tables Into Which. Simon’s Charts Have Been Condensed 

These charts, however, may not always be available, and 
apart from this they may seem too intricate to the Production 
Man, who is not used to statistical reasoning or to reading 
charts of this kind. 

I have therefore thought it advisable to condense the 
information derivable from them into tables, which allow us 
to read at a glance : 

(1) the most probable lot quality for a given combination of 
sample size and defective number (Table 1, Appendix 
VIII) ; 

(2) the lot quality range within which with a very high prob- 
ability — 80 per cent. — ^the true quality of the lot will lie 
for a given combination of sample size and number of 
defectives (Table 2, Appendix VIII) ; 

(3) the range of sample quality (number of defects)^ we must 
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expect with a very high probability — 80 per cent. — for a 

given combination of lot quality and sample size (Table 3, 

Appendix VIII). 

7.8 Probability Graphs for Qualitative Sampling — Straight Line 
Graph for Poisson Distribution 

The reader may have wondered whether the simple 
method of Probability Graphs, which we discussed in Chapter 
III for quantitative sampling, could not be used for discovering 
something about the “ causes,” in the sense of “ lot quality ” 
of qualitative, or number defective distributions. This is 
in fact the case. But there is this important distinction ; 
whereas with normal probability graph paper stress was laid 
upon the graph providing information about the percentage 
of the product within specifications, and only secondarily 
about the nature of the distr’ibution, it is just the opposite 
with qualitative probability graphs. Their foremost purpose 
is to provide information about the nature or type of the lot 
distribution. 

The graph paper used for this purpose is the Poisson 
Summation Graph, Appendix V (a and b), and the method 
of constructing a particular graph is as follows : 

The frequency distribution for the values of the variable 
0, 1, 2, . . , n, that is, for samples with 0, 1, 2 ... n defects, 1 
is summed, starting with the frequency of the highest (or lowest) 
class interval. This gives a scries called the cumulative series, 
in which, if we started the summing at the highest value of the 
variable, the rth term (counting the frequency at 0 as the first 
term, that at 1 as the second, and so on) gives the frequency 
of samples having a value of r — 1 or more, and if we started 
at the lowest value of the variable, a series in which the rth 
term gives the frequency of samples having a value of r— 1 
or less defectives 

Reducing each integi'al frequency thus obtained to a total 
number of 100 samples gives tlie cumulative percentage series. 
These frequencies are now plotted on Poisson probability 
graph paper * at tire crossing points of the horizontal at the 

Appendix Va, if svmniing from highest value. Appendix Vb, if summing 
from lowest value 
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corresponding percentage (vertical scale) with the curve for 
the corresponding value of the variable. 

The series need not be shifted by half an interval as in 
the i^ase of quantitative distributions, because the scale of the 
variable is here discontinuous, and the distribution, therefore, 
not a grouped distribution. 

If now the parent or lot distribution was of the Poisson 
type, and this it will be if the defectiveness is small (less than 
10 per cent.), the crossing-points so determined will lie more or 
less on a vertical erected at the distribution mean. For example, 
let us assume that 100 samples of 100 articles each, say metal 
rods, were inspected with regard to whether the length of 
each rod did or did not exceed a specified amount, and were 
arranged according to sample quality as follows : 


Table 7.1 


No. of defects 
in samples 
of 100 

Frequency 

of 

samples 

Cumulative 

Percentage 

series 

0 

6 

100 

1 

1.5 

05 

2 

22 

80 

3 

23 

68 

4 

17 

3,5 

5 

10 

18 

(3 

5 

8 

7 

2 

3 

8 

1 

1 


Mean = 3 <r = Fs 


As the graph of the per cent, series shows (Fig. 7.3), the 
distribution of points is very well fitted by the vertical erected 
at the mean. From this we infer that the probability of a 
defective article is fairly constant, and that the lot can be 
regarded as being approximately 3 per cent, defective. 

Looking at the graph, we see at a glance that an increase 
in sample size (say 150 instead of 100), will have no other 
effect than to shift the vertical line to the right, because in 
spite of an increase in mean defectiveness per sample, the 
charaetpr of the distribution has remained the same. 
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If ovir samples yield a distribution of points which can be 
satisfactorily fitted by such a vertical line, the type of control 
chart suitable for watching the process of manufacture is the 
usual type of number defective chart which, as we knoyv, is 
based upon the Poisson Summation Function (paragraph 6.3). 

7.9 Straight Line Graph for Binornial Distribution 

The distribution of number defectiveness need not, how- 
ever, follow the Poisson Law. It will not do so, for instance, if 
the defectiveness is comparatively high (greater than 10 per 
cent.), and the sample size small. 

Let us assume we had inspected 1 00 samples of metal rods 
of 6 specimens each for length, and had found the following 
distribution of samples of 6 containing 0, 1, 2 defec- 

tives, that is, rods of incorrect length. 


Table 1.2 


No. of 
defectives in 
sample-t of 6 

Frequency 
of samples 
of 5 

Cumulative 

percentage 

series 

0 

33 

100 

1 

41 

67 

2 

20 

26 

2 

5 

6 

■i 

1 

1 


Mean = 1 a = 0-9 


As shown in Fig. 7.4 the cumulative percentage series gives 
a straight line inclined to the left. This is the typical Poisson 
probability graph for the Binomial (p + 9)", which, for our 
illustration becomes (0-8 -f 0-2)®. The probabilities of samples 
of ?2 (6 in our case) of different qualify are then given by the 
successive terms in the expansion of that Binomial. 

If We Wanted to control the process of manufacture, giving 
rise to a line inclined towards the left, by means of quality 
control charts, then the 1/40 and 1/1000 Hmits should not be 
calculated according to the P oisson Law a s i 2 VM and ± 3 VM 

respectively, but as ± and ± 3 

respectively (see paragraph 6.6). 
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Fio. 7.3 

Poisson Distribution far Samples of 100 in 3 per cent. Defective Bulk (Table 7.1) 

FiO. 7,4 

Binomial (0-8 + 0-2)® (Table 7,2) 

7.10 Straight-Line Graphs for Compound Distributions 

There is yet another typical probability graph which is of 
great importance. 

A line inclined towards the Irft will as a rule result from 
machined characteristics, such as length or diameter of single 
components, if the defectiveness is greater than 10 per cent, 
and the sample size small. A vertical line wiU arise either 
from simple machined characteristics or for characteristics of 
assemblies of components, provided the defectiveness is small 
(less than 10 per cent.). If, however, the several machines, on 
which an article is turned out, differ significantly in precision 
with regard to the characteristic in question, or if the results 
of assembly processes are examined with regard to characteristics 
into which enter the qualities of various components, and if the 
percentage defectiveness is high, then the hne which can be 
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fitted to the scatter of points on the Poisson paper will often 
be found to incline towards the light. 

The starting-point of the Theory of Repeated Events or 
Multiple Happenings was, as we know, the statistical investiga- 
tion of accidents, and it will therefoi’e not be surprising that the 
extension which we are now about to discuss was also first 
observed and studied in connection with accidents. At the 
basis of this extension lies the idea that accidents are not only 
just accidents, but depend also to some extent upon the 
individual proneness to accidents of a certain type. 'We say 
colloquially that some people are “ all thumbs,” by which wc 
mean that they are liable to incur accidents more frequently 
than others. The frequency distributions which we meet in 
connection with accidents will, therefore, generally be mixtures 
or compounded distributions consisting of partial distributions 
around different means. 

The following illustration is based upon an observation of 
647 women workers in a munition factory during the Great 
'War of 1914-18, with regard to industrial accidents.* 


T.'iBLE 7.3 


No. of 
accident.^ 

Observed 

frequency 

Cumulative 

series 

Cumulative 

% 

0 

447 

647 

100 

1 

132 

200 

31 

2 

42 

68 

11 

3 

21 

26 

4 

4 

.3 

0 

1 

5 

2 

2 

— 


Proceeding in the same way as before, that is, forming the 
cumulative per cent, series and plotting the results on prob- 
ability graph paper, we find that the points can be satisfactorily 
fitted by a straight line inclined towards the right (Fig. 7.6fl). 
Another distribution of this type is repre.sented by the graph, 
Fig. 7.6^. 

In industrial work, as observed above, we meet with this 
type of graph when the characteristic in que,stion is of a complex 

^ * YuJe and Greenwood (1920), J, Ray. Stat, Soc., 83, 255 
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nature depending upon the qualities of several components 
plus the operation of the machine. 

This is, for instance, true of the form of completed cartridges 
or “rounds.” A run of 188 samples of 50 specimens each, 
inspected with regard to whether they fitted into the form 
gauge or not, gave the following distribution : 


Table 7.4 


No. of 
defectives in 
samples of 50 

Frequency 
of samples 
observed 

Gumulativf ’ 

scries 

Cumulative 

% 

0 

133 

188 

100 

1 

39 

5.5 

29 

2 

9 

16 

9 

3 

6 

7 

4 

4 

2 

2 

1 


The cumulative percentage series plotted as before can 
again be fitted by a straight line inclined towards the right 
(Fig. 7.5r), which by chance is very close to that obtained from 
the accident distribution in Fig. 7.6a. 

It is well worth noting that the differences between the 
pictures of our three typical lines has its exact parallel in the 
statistical differences between the three distributions. The 
line inclined towards the left represents a simple Binomial — that is, 
a population where defectives have a constant probability — 
q — and, therefore, successes a constant probability— The 
vertical line is the geometrical picture of a distribution which, 
although giving the outward appearance of a homogeneous 
distribution, that is, with constant p and q, is really a mixture 
of a number of distributions Avhose differences in means, 
however, are so small that they are swamped by the comp)ara- 
tively great variability within each distribution. And finally, 
the line inclined towards the right represents a mixture of distribu- 
tions with significantly different means, that is, with differences 
between the various component distributions so great as to 
outweigh the variations within these distributions.* 

* Note for the mathematical reader : This line is also algebraically the 
reverse of the line inclined towards the left because it can be interpreted as re- 
presenting the Binomial with negative index. For instance, Fig, l.Bd is the 
probability graph for the expression 0-7* (1 — 0-3) 





Fro. 7.6 

Probability Graphs of Various Compound Poisson Distributions 

^.5a Compound accident distribution M = -47, a = -83 (Table 7.3) 

7.65 Compound accident distribution M = 4'86, a = S-1 
7.5c Compound industrial distribution (Table 7.4) 

‘7M Binomial with negative index 0-73(1 — 0-3)~3 

From the interpretation of the three types of straight-line 
graphs it is clear that there will be transitions between the three 
corresponding distributions. The gi'aphs will in such cases 
often appear as lines curved to the right or to the left, and be 
of more or less irregular shape.* 

Processes yielding a graph on Poisson paper, like Figs. 
T.5a-d, cannot be satisfactorily controlled by the usual type 
of chart, and the technique outlined in paragraphs 6.8 to 6.10 
is recommended. 

♦ Yule (1910), J. ftqpi. Slat. Soc., 20 
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7. II Information to be Derived from Such Graphs 

We now come to the practical purpose of such graphs. 
It is 

(1) tp get a picture of the nature of the distribution in the lot, 
especially whether the probability of a defective remains 
sensibly constant, or whether it varies from time to time or 
from machine to machine, or with any other criterion for 
sub-grouping. According to whether the graph is a Une 
inclined towards the left, a vertical line, or a line inclined 
towards the right, we conclude that the probability of a 
defective remains constant, that its variability is negligible, 
or that it is significant ; 

(2) to get an estimate of the defectiveness in the lot from 
which the samples were taken ; 

(3) to enable us to estimate the proportion of samples of a 
given size suffering from a specified amount of defectiveness. 
In order, however, that the graph should give useful 

information of this kind about the lot or population, we must 
first be in a position to know to what extent the sample can be 
identified with the lot with regard to quality. This reduces 
to the question of whether the straight line by which we have 
fitted a given scatter of points represents a “ good fit.” If so, 
both the nature of the lot distribution and tlie percentage 
defectiveness in the lot can be inferred from the graph. The 
problem is slightly different from one type of graph to the other. 

(a) If the points lie more or less along a vertical line, the 
theoretical line is given by the vertical erected at the mean 
defectiveness of our sample. From that line, which represents 
the theoretical cumulative percentage series, we easily obtain 
the theoretical frequency distribution, and we then have two 
frequency distributions to compare with one another, the 
observed and the theoretical percentage series. 

The comparison is carried out according to the Glii-square 
test (see paragraph 3.5), 

[b) If the points lie along a line inclined towards the left, 
the obvious and most satisfactory way for constructing the 
theoretical line is to derive it from the expansion of the Binomial 
in question. For that purpose we would calculate the per- 
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centage defectiveness in our samplcj regard that fraction as q, 
which at once gives p, and we could then set up the Binomial 
with the sample size, n, as index, whose expansion would give 
the successive terms of the theoretical frequency distribution 
(as fractions of unity). We could then proceed to compare 
the theoretical with the observed distribution (reduced to a 
total frequency of 100) by means of the x“-test. 

But this procedure becomes laborious if the index of the 
Binomial is great, and it can therefore not be recommended 
for ordinary routine work. The only alternative is to draw the 
best fitting line to our scatter of points. From that best fitting 
line which represents the cumulative percentage series, we 
obtain easily the frequency disti’ibution and have again two 
distributions, the observed (reduced to 100) and the 
“ theoretical,” to compare with one another according to the 
Chi-square test. 

(c) If the scatter of points is along a line inclined towards 
the right, the satisfactory way of fitting a theoretical distribu- 
tion to that scatter, and hereby finding the correct line repre- 
senting the lot, would be to employ a function called after its 
discoverers the “ Yule-Greenwood Curve,” representing an 
infinitely compounded Poisson distribution. The comparison 
between the observed and the theoretical frequency distribution 
is then again carried out by means of the Chi-square test. As in 
the case of (i), however, the process involves somewhat heavy 
arithmetic, and it is recommended to draw the best fitting line 
free-hand and then proceed as in the alternative case for (i). 



Chapter VIII 


THE* SHORTHAND OF INSPECTION— A RATIONAL 
SAMPLING SCHEME 

8.1 Ascertaining Lot from Sample Quality as Distinct from Limiting 
the Consumer’s and Producer’s Risk — ^The Dodge-Romig 
Sampling Method 

The relation between lot and sample quality has been explained, 
and it has been shown how to make use of it in estimating lot 
quality from sample quality, and vice versa. 

What emerged from this was that we could not have both 
definiteness and correctness of estimate at the same time. If 
we prefer to be correct in the long run, we had better sacrifice 
some of the definiteness and be content with a certain small 
range of lot quality. So long as this is understood and both 
producer and consumer agree — not upon a definite quality, but 
upon a range of such quality — ^there does not seem to be any 
further need for more complicated statistical procedures. 

There is, however, something unsatisfactory about this way 
of making statistics the supreme arbiter for acceptance and 
rejection, and leaving it at that, since it unavoidably results in 
a certain small percentage of lots being wrongly classified. It 
has been rightly said by W. E. Deming : “ There is no use 
in taking data, if you do not intend to do something about the 
source of the data.” 

We have seen how efficient statistics can be during manu- 
facture, by firmly leading a more or less unstable process 
towards a state of control, and the question arises whether 
sampling inspection, too, could do more than " accept ” or 
“ reject.” 

As we shall see in this chapter, the answer is in the affirma- 
tive. It is possible by a judicious weeding out of defective 
work to do for the finished article very much the same as' 
quality control does during the process of manufacture, viz, 
reduce the amount of defectiveness and ensure a stable quality 
in the outgoing product. 

It must be kept in mind that, in using Simon’s charts, 
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attention is primarily fixed upon the quality of the individual 
lots as judged by samples of arbitiaiy size. It is, then, either 
a case of ascertaining the lot quality from the sample quality, 
or of specifying the lot quality and accepting only such a range 
of defects in a sample of arbitrary size as is compatible with 
that pre-determined quality target.* 

Very often, however, we are faced with the problem : 
“ What sample size should one take ? ” What is meant is 
evidently this ; “ What sample size is one to take if, in the 
long lun, the consumer is to get what he wants, without exposing 
the producer to an undue risk of having good stuff rejected?” 
Or briefly : “ What is the most economical sample size com- 
patible with safeguarding both consumer and producer against 
undue risks ? ” 

Schemes of that kind are available. Such a method does 
not primarily fix upon a certain quality of the individual lots, 
but aims at limiting the risks for both the consumer and the 
producer (if lot quality is estimated from that of a comparatively 
small sample) over a long period of time, or for a very great 
number of lots collectively. The standard method with these 
aims in view was developed in the Bell Telephone Company 
laboratories, U.S.A., and is, after its originators, called the 
Dodge-E-omig method. We shah explain it in the following 
paragraphs, using the minimum amount of mathematics com- 
patible with making the matter really understandable to the 
reader. 

Paragraphs 8.6 to 8.8 give illustrations of how the method 
works and will, it is hoped, demonstrate the eminent practical 
advantages to be derived from it. 

8.2 The Consumer’s and Producer’s Risk Explained 

Let us assume that the consumer does not want to run a 
risk greater than 10 per cent, of accepting individual lots of a 
quality worse than a stated amount, and let that stated amount 
be, say, 3 per cent. This is called the lot tolerance per cent, 
defectiveness. What this stipulation on the part of the consumer 
amounts to is drat he definitely wants to make sure that his 

♦ For Simon’s method of dealing •with related iots (” Grand Lot System 
the reader is referred to Chapter YI, loe. dt. 
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risk of accepting a quality worse than 3 per cent, is less than 
1 in 10, and that this risk will become increasingly small as 
the quality gets worse. In order to safeguard the consumer 
in tlys way, the following ideas are used. 

We know that comparatively small samples from a lot of 
3 per cetrt. defectiveness will have a fairly large range of 
defectiveness. According to their defectiveness they will have 
definite probabilities of occurrence. If the risk is to be 10 per 
cent., we ask ourselves what is the maximum defectiveness in 
samples of a certain size, if their number or frequency must 
not amount to more than 10 per cent, of all the samples of 
such size ? Taking that maximum defectiveness as the border- 
line between acceptance and rejection will involve the con- 
sumer in accepting in the long run 10 per cent, of the product 
if it is not worse than 3 per cent, defective. 

Now, it is evident that, if the defectiveness of the product 
increases, the percentage of samples with defects up to the 
number we have determined will become less, and this means 
that the consumer will be effectively protected according to 
his own stipulation, namely, that his risk of accepting a lot of 
a quality worse than 3 per cent, will be less than 10 per cent. 

But this is only one side of the picture. We must also take 
into account the producer’s standpoint, who does not want 
to have too many lots unnecessarily rejected for 100 percent, 
inspection. By “ unnecessarily rejected ” is meant that he 
does not want to have lots rejected for bad quality if the average 
quality of his product has not deteriorated and is, on the whole, 
satisfactory to the consumer. This could happen if the 
comparatively small sample gave an unsatisfactory picture of 
the lot quality. 

It is evident that as tire consumer, for his own protection, 
draws the border-line between acceptance and rejection so as 
to accept only 10 per "cent, of 3 per cent, defective stuff, and so 
rejects 90 per cent, of it, he will reject a certain smaller per- 
centage of 2 per cent, defective, and a certain still smaller 
percentage of 1 per cent, defective lots, although 2 per cent, 
or 1 per cent, defectiveness might he perfectly acceptable to 
him. In this way the producer would suffer by having accept- 
able lots rejected. 
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The question now is how to combine the consumer’s' 
protection with a reasonable safeguard for the producer, so as 
to minimize the latter’s risk of having good lots rejected. 

Generally speaking, if the consumer wants to be protected 
with a certain probability against accepting lots which are 
worse than a stated amount of defectiveness, then the price 
which the producer has to pay is that a certain percentage of 
lots which are not worse than that slated defectiveness, will be 
rejected, the percentage of such rejections getting smaller, the 
better the lot quality. The problem is how to make that 
“price” a minimum. 

How this is achieved by manipulating the sample size will 
now be shown without mathematical formulae. 

8.3 Combining Both Standpoints 

Let us assume that we are dealing with lots of 1000, that 
the consumer wants to be protected against a risk greater than 
10 per cent, of accepdng individual lots which are worse than 
3 per cent, defective, and that the process average defectiveness — 
that is, the average defectiveness of the product during a long 
period of manufacture, is -45 per cent. 

What is wanted is a combination of sample size n and 
number of defects c, so that not only will the consumer be 
protected according to his own wishes, but the producer’s risk 
will be minimized by a plan involving a minimum risk of 
rejection. As every rejected lot is to be 100 per cent, examined 
by the producer and cleared of defectives, this is tantamount 
to a plan involving a minimum amount of inspection. What 
we are aiming at is therefore that the number of lots which 
are wrongly rejected for bad quality, and are therefore to be 
100 per cent, inspected, shall be the smallest compatible with 
the stated consumer’s risk. 

For a given defectiveness in the lot (3 per cent.), we can 
easily arrive at a combination of a and c, so as to guarantee 
that the consumer’s risk of accepting a lot worse than 3 per 
cent, defective will be smaller than 10 per cent. Fpr a sample 
size of, say, 75, we get the mean defectiveness of samples of 
such size by multiplying 75 with the defectiveness in the lot 
— •03~which gives 2-25. With that mean defectiveness 
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we now enter lire Poisson probability chart (Appendix V) 
on the horizontal scale, and read that 10 per cent, of such 
samples will show no defects at all. The consumer will therefore 
accej)t the lot only if a sample of 75 is clear of defectives. 
Proceeding in the same way for n = 125, we read from the 
Poisson probability chart that 10 per cent, of such samples will 
contain 1 or less defectives. For n ~ 170 we proceed the same 
way, and read that there will be 10 per cent, of such samples 
with 2 defectives or less. We thus can get a number of co- 
ordinated n and c, so as to protect the consumer according to 
his stipulation, because if the lot quality gets worse, less than 
10 per cent, of the samples will have a defectiveness equal to 
or lower than c, and .so lead to more frequent rejection. 

The question now is, which of these co-ordinated n and c 
will satisfy the producer by minimizing the risk of 100 per 
cent, inspection ? The answer can again be obtained from 
the chart. In order to know what each of the sampling plans 
in the first and second column of the following table means for 
the producer, we calculate the mean defectiveness of samples 
of the size as stated in column 1 of table 8.1 for a product of 
•46 per cent, process average defectiveness. 

Doing this for the first sample of 75, we obtain the mean 
defectiveness by multiplying 75 by -0046 = •3375, and enter 
the Poisson probability chart with this figure as the mean on 
the horizontal scale. The crossing-point with the curve for 
zero defectiveness projected to the left gives the result that the 
probability of such samples appearing will be -713. Using 
sample size 76 and allowable number of defects 0 as our 
sampling criterion, this means that of 1000 lots we shall accept 
713 and consequently reject 287. These proportions, expressed 
as fractions of 1, represent respectively the probability of 
acceptance and of having to inspect a lot 100 per cent., and 
•287 is therefore the producer’s risk of having acceptable pro- 
duct rejected. 

Proceeding in the same way for the following sample, « == 125 
and c = 1, wc obtain the mean defectiveness per sample of 
126 items in the lot of -46 per cent, defectiveness as -55625. 
Entering the chart with. '566 as the mean defectiveness on the 
horizontal scale and projecting the crossing-point with the curve 

(612) 14. 
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for one defective or less to the left, we read the probability of 
acceptance as >891, .and consequently a probability of rejection 
of •] 09, and similarly for all the other combinations of n and c. 
The results are set out in columns 3 and 4 of the followine' table : 

r 


Table 8.1 * 


n and c combinations 
for lot size 1)000^ 
lot tol. % defec. 3% 
cons, risk OTO 

Application to product having 
process av. % def. — 040% 




prob. of 

Av. No. of pieces 


Allowable 

Prob. of 

inspecting 

inspected per lot 

Sample 

defect 

acceptance 

remainder 




size n 

number 

by samplf' 

of lot 







(prod, risk) 

sample 

remainder 







of lot 

Total 

73 

0 

•713 

■2S7 

73 

205 

— 

125 

1 

■801 

•109 

125 

95 


J-170 

t 2 

t-058 

+ •042 


+ 35 

f'm 

210 

3 

•984 

•010 

210 

13 

223 

250 

4 

•994 


260 

5 

256 

200 

0 

•098 

•002 


1 



0 

■999 + 

•000 + 

325 

0 

325 


In order to determine which sampling plan involves the 
minimum amount of inspection, we proceed in this way. 
We must find out what is the average number of pieces inspected 
per lot of a given size. This average number will be composed 
of the number inspected in the sample plus the average number 
inspected In the remainder of the lot. The number in the 
sample is, of course, identical with the sample size. For the 
first combination of n and c it is 75. From a lot of 1,000 there 
remain, therefore, in the lot 925 pieces. In column 4 -we see that 
of 1000 lots 287 will have to be 100 per cent, inspected. The 
average number of pieces inspected per lot remainder will 

, thereiore be — — , which equals . Ignoring the 

decimals, we obtain 265 as the average number of pieces in- 

Altei" Dodge-Romig, Sampling Inspection Tables, p. 8 
■{■ Plan involvfing rnimmmn amount of inspection 
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spected in the remainder of the lot for this combination. The 
total number inspected is 75 + 266 = 340. 

In the same way we proceed for each combination of n and c. 
The results are entered in columns 6, 6, and 7 of the table. 
We s^e from it that the sampling plan n = 170, r = 2, involves 
the minimum amount of inspection, because the total average 
number of pieces inspected per lot is the smallest in the series. 

8.4 The Dodge-Romig Tables and the Advantages to be Derived 
from Using the Method 

In this way the first set of the Dodge-Romig Tables (SL 
tables) were constructed for certain ranges of lot sizes, lot 
tolerance, and process average defectiveness. From them such 
combinations of sample size and allowable defective number 
as will protect both the consumer and the producer, and 
reduce inspection labour to a minimum, can be read off at a 
glance for various lot sizes, lot tolerances, and process averages 
(see paragraph 8.5). 

A second set of tables (DL) arranged in a completely 
analogous way, gives the proper sample size and allowable 
defective numbers if the lot is given a second chance by taking 
a resample. An illustration of that procedure is given in para- 
graph 8.7. Specimens of these two kinds of tables are given in 
Appendix IXa, b. 

The result of applying the Dodge-Romig method is that 
certain lots will be accepted on the strength of samples. Others 
will be rejected and 100 per cent, inspected. The first type of 
lot will naturally contain a certain amount of defectives, as only 
the defectives in the satisfactory samples will be removed. The 
second type of lot, however, will be completely cleared of 
defectives. It follows that the total, consisting of both types of 
lots, tvill be reduced in defectiveness compared with the total 
before inspection. 

The Dodge-Romig method also enables us to predict the 
defectiveness in the outgoing total — ^that is, after inspection— 
and to adjust our sampling scheme so as to ensure that a stated 
amount of defectiveness wiU not be exceeded. In this respect, 
the method acts like an adjustable sieve for separating defective 
from good work. 
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Two parallel sets of tables, one for single sampling (SA), 
the other for double sampling (DA) allow us to read n and cfor a 
given limit of defectiveness in the outgoing total (lots accepted by 
sample plus those which have been cleared of defects). (See 
conclusions to Tables 8.2, 8.4.) The SA tables are giVen in 
condensed form in Appendix IXc. 

We thus find three prominent characteristics of the Dodge- 
Romig method ; 

(1) It works with sample sizes and allowable defect numbers 
which are calculated so as to protect both consumer and 
producer. 

(2) It considerably reduces inspection labour. 

(3) It enables us to set an upper limit of defectiveness after 
inspection — that is, in the outgoing total. 

8-5 Illustration of the Scheme for Single Sampling— Demonstration 
of the Advantages to be Derived from It 

Let us now check the correctness of the claims put forward 
by the Dodge-Romig method as regards effectiveness in 

(1) Protecting the consumer ; 

(2) Reducing inspection labour ; 

(3) Limiting the defectiveness in the outgoing total. 

This will also serve as an illustration of how the method is 
applied to actual conditions. 

[a] For the purpose in view it is necessary to be in a position 
to compare the defectiveness of the product before inspection 
with that after inspection, and so to check the prediction made 
by theory. The Universe (this is the technical term for the 
total of the product) must therefore be completely known. 

It is further desirable, in order to reproduce actual con- 
ditions of sampling as closely as possible, that human bias in 
dividing the universe into lots of varying degree of defectiveness 
should be excluded. That is, the number and distribution of 
defectives in a lot should be truly random. The way in which 
these conditions were fulfilled in the present experiment will be 
described in paragraph 8.6. 

{b) Let there be given 25 boxes or linen bags containing 
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each. 400 wooden or steel balls, icprcsciiting a kind of industrial 
article, of which the percentage of defectives as shown in 
Table 8.2 is marked, say, red. 

Supposing now that it was desirable to eliminate the defec- 
tiveness in the total formed by those lots. We could, without 
using statistical methods, do nothing but inspect all the 10,000 
pieces individually and clear the lots of defective work. If we 
do that on a gauging machine we know that a certain per- 
centage of wrongly classified articles is inevitable. If the 
examination is done by human examiners, errors are still 
more likely to creep in, especially if the inspection is visual and 
the examiners differ in their judgment of what constitutes an 
error. A certain small percentage of defects in the outgoing 
product is therefore inevitable, even with 100 per cent, 
inspection. 

Realizing this, it would seem only sensible to accept before- 
hand a proper limit of defectiveness which can be made as small 
as desired, because by doing so we comply with one of the 
essential conditions for using such combinations of sample 
sizes and allowable defect number as will reduce inspection 
labour to a minimum. This will be seen from the conclusions 
to Tables 8.2 to 8.6, pages 203, 209, 213, 

If, however, not complete elimination but only reduction 
of defectiveness* was required, then it is evident that 100 per 
cent, inspection must result in waste of time and money. 

(c) For the purpose of determining the quantities necessary 
for using the Dodge-Romig Tables a number of “ lots ” were 
inspected for defectiveness. It appeared at an average to be 
somewhat greater than B per cent. This gives a process 
average (p) of -OB. As only a small number of lots were 
sampled for that purpose, the estimate must be regarded as 
tentative. 

It was further assumed that the consumer wanted to exclude 
definitely the possibility of accepting individual lots with more 
than lO per cent, defectiveness. This gives a lot tolerance 
defectiveness (pj of 10 per cent. Strictly speaking, the system 
does not protect the consumer in this respect absolutely, but 
only with 90 per cent, certainty. 

The percentages of p and pi arc higher than those which 
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Table 8,2 * 


1 

Lot No. 

No. of defectives 

Total No. of defectives 

0/ 

/o ^ 

defective- 

ness 

In 

sample 

In rent 

Accepted 

ainder 

lOOO/o 

inspected 

In lots 
accepted 
by sample 

In lots 

1007 o 

inspected 

I 

,1 

!) 


12 


3-0 

11 

1 

6 ' 


7 


1-8 

in 




11 


2.8 

t(IV) 



(11) 


(18) 

(4-6) 

V 




12 


3-0 

VI 




n 


2-8 

VII 




7 


1-8 

VIII 




9 


2.2 

IX 




17 


4-2 

X 

3 

11 


H 


3-5 

XI 

G 

10 


10 


i-0 

XII 

4 

20 


24 


6.0 

XIII 

0 

17 


23 


5-8 

t(XIV) 

(0) 


(19) 


(28) 

(7.0) 

XV 

4 

7 


11 


2-8 

XVI 

6 

16 


22 


6-6 

XVII 

2 

25 


27 


6-8 

t (XVIII) 

(U) 


(10) 


(21) 

(6-2) 

XIX 

4 

14 


IS 


4-5 

t(XX) 

(11) 


(16) 


(27) 

6-8 

t(XXI) 

(21) 


(65) 



(19) 

t(XXII) 

(18) 


(48) 


(66) 

(14) 

t(xxill) 

(13) 


(53) 



(14) 

t(XXIV) 

(11) 


(36) 



(11-76) 

t(XXV) 

(13) 


(38) 


(51) 

(12.76) 


169 186 286 241 400 

' ’ ' ' 

041 641 

Of this total 56 occur in 
not bracketed samples 


* The number in front of the first vertical dividing line gives the number 
of defectives found in the sample, and that after the line gives the remaining 
number of defects in the lot. It goes without saying that in practical work the 
total defectiveness would be ascertained only in those lots which do not pass 
the sampling criterion. 

t Lots IV. XIV, XVIII, XX, XXI-XXV to be 100 per cent, inspected 
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Results 

(1) Total No. of articles IQ'OOO 

Defects in samples of accepted lots 55 

Defects in 100% inspected lots 400 

^ 

Outgoing Total !l-r)45 

Total number of defects in lots accepted by sample (after re- 
moving the defects found in those samples! 186 

Defectiveness of total after inspection, including lots aecepled by 
sample and those which were 100 per cent, inspected and cleared 
of defects L = 1.95 % 

(2) Total number of pieces inspected : 9 Lots x 400 s= 3,000 

IG Lots X 100 = 1,600 

5,200 

That is, 52 per cent, of 10,000 
Au erage number insjiecleti per lot = 208 approximately 


would be acceptable in practical work, because our lots showed 
a high defectiveness, according to what was chosen as con- 
stituting a defect. 

The combination of the lot size N = 400, p = 5 per cent, 
approximately, and j!;, = 10 per cent, allows us to calculate or 
read from the Dodge-Romig Tables a combination of sample 
size and allowable defect number, wltich will protect both 
consumer and producer and will moreover result in an appreci- 
able reduction of defectiveness. 

We read for those conditions from Table SL.IO * a sample 
size n = 100, and allowable number of defects c = 6, and a 
maximum defectiveness in the outgoing total of 2-9 per cent. 
We now sample lots I-XXV according to that combination 
of n and c by drawing samples of 100 from each lot and 
recording the number of defectives. A lot with more than 6 
defectives in a sample of 100 must be lOO per cent, inspected 
and cleared of defectives. The result is set out in Table 8.2 
opposite. 

(d) Conclusions 

We see from the above table that : 

(1) None of the accepted lots was Worse than 10 per cent, 

defective. 


♦For specimen table see Appendix IXa 
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(2) None of the lots which were more than 10 per cent, defective 
passed the criterion. 

These two results show the efficiency of the sampling- 
criterion, that is, the combination of sample sizes and allowable 
number of defects, in making the results comply with what we 
wanted at the outset. 

(3) The number of inspected individual pieces is about half 
the total number of articles — 6,200 instead of 10,000. This 
shows a great reduction in inspection labour as the result 
of applying a rational method of sampling. 

(4) In spite of having thus reduced inspection labour by 
approximately one half, the fraction defectiveness of the 
product after inspection is less than 2 per cent., as against 

641 

the defectiveness before inspection of 6.4 per cent. 

J w j w W V/ 

It is also less than the acceptable limit of defectiveness 
after inspection, as given by the Dodge-Romig Table under 
the given conditions, viz. 2.9 per cent. 

These results show that the predictions made by theory 
and the advantages promised by it are fully borne out by the 
experiment. Inspection labour and defectiveness of the product 
are both considerably reduced, and the defectiveness in the 
total after inspection is less than the predicted limit. 

8.6 The Sampling Experiment on Tables of Random Sampling 
Numbers which Provided the Data for the Foregoing and the 
Following Illustrations 

The story told in II is the translation into physical terms 
of what happened when 26 Tables of Random Numbers were 
sampled for four-figure sets containing zeros. 

The reason for first constructing a kind of frame or skeleton 
for the actual experiment is indicated in paragraph 8.5 (a). Let 
us be a little more explicit about it. 

Any sampling scheme is based upon the assumption that 
defectives will be randomly distributed in the total. In order, 
therefore, to reproduce actual conditions, we have to mix 
thoroughly the 400 balls which form a lot. ’ ’ 

Now practical experiment has demonstrated that it is 
impossible to mix the balls or other objects so as to make their 
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arrang’cmcnl completely random. In whatever way we may 
attempt to achieve the result of a complete random distribution, 
human bias creeps in and disturbs the randomness apart from 
the disturbing factors due to lack of homogeneity among the 
objects. In short, balls or tickets or other objects fail in 
large-scale random tests ; it is as difficult to get artificially 
true random samples as it is to .sample effectively a cargo of 
coal or barley (see Foreword to Tippett’s Random Kumher 
Tables). 

In our case the difficulty is increased by the experimental 
design according to which some of the lots are not signifi- 
cantly different in defectiveness. In such a case, the 
arbitrary dillercntiation between lots, by simply allocating 
different numbers of defectives to them, would be very unsatis- 
factory. 

To overcome difficulties of that kind, K. Pearson suggested' 
to L. H. G. Tippett that he should replace the whole system 
of tickets or balls by a single random system of numbers ranging 
from 0-10,000, In order to form such a table of random 
numbers, 40,000 digits were taken at random from census 
reports and combined by fours to give 10,000 numbers, which 
were arranged in twenty-five tables. Here is a specimen from 
one of Tippett’s tables : 


1690 

3009 

7000 

6066 

6298 

6090 

8680 

6750 

^001 

6993 

6243 

6375 

5088 

2078 

8339 

0322 

2876 

0399 

0741 

2579 

7652 

4J71 

1364 

9390 

2670 

7909 

2909 

8705 

0188 

9808 

7200 

8104 

6808 

4329 

0081 

9862 

9468 

0863 

8491 

1166 

820S 

1126 

9153 

9517 

1481 

0870 

2365 

2616 

3117 

0239 

0770 

8034 

0978 

9120 

0967 

0334 

S-SSS 

1979 

1120 

3109 

1109 

0151 

3869 

3360 

0039 

4461 

8674 

6296 

8111 

6748 

7451 

2141 

2t00 

4971 

0464 

1849 . 

7065 

3093 

6860 

7777 


As a universe complying with strict experimental conditions 
of randomness, the 10,000 four-figure sets known as Tippett’s 
Random Numbers (which form Tippett’s Tables I-XXV, 
in No. XV of the “ Tracts for Goniputors,” Cambridge 
University Press) were therefore chosen. Each of the four- 
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figure sets was taken to represent one observation and the 
occurrence of a zero in such a set as denoting a defective. A 
lot was taken to be formed by the 400 sets of four figures com- 
prised in one table. This gave 26 lots. ^ 

In order to obtain a fairly wide range of variation of the 
lot fraction defectiveness, the number and position of zeros 
making a set of four digits a defective was varied. 

For Tables I-X a defect was defined as the occurrence of 
two adjacent zeros in one four-figure set. 

For Tables XI-XX a defect was defined as two zeros in 
any position, including the case of adjacent zeros. 

For Tables XXI-XXV a defect was defined as one zero, 
not occurring either as die first or the last figure, excluding thus 
all sets with more than one zero and with zero at the extreme 
places. 

From the lots I-X, XI-XX, and XXI-XXV a small 
number was selected and the number of defectives ascertained. 
From the results, which were weighted according to the 
number of tables of one and the same type of defectives, 
the fraction defectiveness representing the process average 
defectiveness (p in the Dodge-Romig’s terminology) was 
determined. It resulted as somewhat over 6 per cent. 

Putting the defectiveness which a consumer was willing to 
tolerate in an occasional single lot (but by no means in the 
grand total) as 10 per cent., we arrive by the D.R. Tables for 
lots of 400 specimens at a sample size of 100, and 6 as the 
allowable number of defectives, and this formed the sampling 
criterion for the experiment described in paragraph 8.6. 

8.7 The Double Sampling Scheme Illustrated 

(a) A modification of the Dodge-Romig system as exjDlained 
above is the Double Sampling System. In double sampling 
the lot is given a second chance of acceptance if the first sample 
results are unfavourable. Other things being equal, the 
average amount of inspection for double sampling is less than 
for single sampling. On the other hand, the defectiveness in 
the total after inspection will be slightly greater. 

The procedure is set out in the following table : 
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Table 8.3 


Inspect a first sample 
of pieces 


If the number of defectives found 
in the first sample 

I 


does not exceeds c, but does exceeds 



For our process average of slightly more tlran 5 per cent., 
a lot tolerance of 10 per cent., and lot size of 400, we find 
from Table D.L. 10.* 

Ill =60 Cl = 2 

Hz = 90 

Til and = 160 c^ = 10 

Upper limit of defectiveness in outgoing total : 3'2 per cent. 

For a sample of 60, the first 60 sets of four figures of each 
table were chosen, for the additional sample of 90 the following 
90 four-figure sets. The results are set out in Table 8,4, which 
is arranged like Table 8.2. 

* For specimen table see Appendix IXb 
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Taulc 8,4 


Lot 

No, 

No. of defectives 

lotal No. of 
defectives 

Defective- 

ness 

In sample 

In remainder 

In lots 
ac- 
cepted 
by 

sample 

In lots 
100% 
in- 
spected 

Lt 

2ud 

1st + 
2nd 

Ac- 

cepted 

10 

7 

4 

n 

11 

n 

7 

8 

8 

12 

100% 

in- 

spected 

I 

II 
HI 
IV 

V 

VI 

VII 

VIII 

IX 

X 

.t 

1 

1 

4 

2 

4 

4 

5 

2 

7 

7. 

1 

1 

9 

2 


12 

7 

11 

18 

12 

11 

7 

9 

17 

14 


3 0 

1-8 

2-8 

4'5 

3-0 

2 8 

1-8 

3-2 

4 2 

3'5 

XI 

4 

3 

7 

!) 


10 


4'0 

XII 

I 


1 

23 


24 


6'0 

XIII 

0 


2 

21 


23 


5-8 

*(XIV) 

(S) 

(0) 

(11) 


(17) 


(28) 

(7.0) 

XV 

3 

1 

4 

7 


11 


2-8 

XVI 

5 

4 

9 

13 


22 


5.5 

XVII 

I 


1 

26 


27 


0-8 

■'‘fXVIII) 

(fi) 

(S) 

(11) 


(10) 


(21) 

(6.2) 

XIX 

4 

3 

7 

11 


18 


4-5 

*(XX1 

(B) 

(7) 

(13) 


(14) 


(27) 

(6-8) 

•'(XXI) 

(10) 

(19) 

(■>9) 


(47) 


(76) 

(19) 

*(XXII) 

(12) 


(12) 


(54) 


(66) 

(14) 

* (XXIII) 

(1) 

(10) 

(17) 


(49) 


(66) 

(14) 

*(XXIV) 

(8) 

(7) 

(15) 


(32) 


(47) 

(11-75) 

♦(XXV) 

(7) 

(14) 

(21) 


(30) 


(SI) 

(12-75) 


189 199 253 259 383 

/" 1 ^ '■ V ' 

/ 641 641 

Of this total 60 occur in 
not bracketed samples 


* Lots XIV, XVIII. XX. XXI-XXV were 100 per cent, inspected 
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Rr siiLTs 

(1) Total number of artide-'! 10 000 

DdVc tivi"! ill sainplrs of accepted Ims ’ (iO 

Defectives in 1 00 per cent, inspected lots 382 


Otitgoing Total 

Total number ol defects in lots accepted by sample — that is, 

after rmnoving the defects fotmd in the samples .... 19a 

Defectiveness of total after inspection, including both lot.s 

accepted by sample tuid .such as were 100 per cent, inspected and 

cleared of defectives = 2 - 08 % 

(2) Total nunibei of pieces inspected 

10 Lots X RO (only 1st sample) = OOO 

7 Lots X l.iO (1st and 2nd samples) — 1,050 

S Lots X *100 (100 per cent.) = 3,200 


‘1,830 

That is, iR‘5 per cent, of 10,000 
Average number of [)ieces inspected pet lot = 19-1 

(/;) Conclusions. 

From results (1) find (2) in the above table we see that 
double sampling works in principle like single sampling. It 
will materially reduce inspection labour and safeguard against 
the defectiveness in the outgoing total product exceeding a 
stated acceptable amount. 

Furthermore, comparing results (1) and (2) of the above 
tables, wc find that the predictions made by theory about the 
difference between single and double sampling with regard to 
the number of inspected pieces, and the defectiveness after 
inspection (see section (a) of this paragraph) are borne out by 
the experiment : 


Sampling 

method 

Number of 
inspected 
piece.? 

Average defectiveness in 
total after inspection 

Found 

Limit 

s.s. 

D.S. 

5,200 

1,850 

1-05% 

208% 

2- 9% 

3- 2% 




210 qUALITY CONTROL 

We need not be deterred by these high percentages of 
defectiveness in the outgoing product- They are the con- 
sequences of the high defectiveness in the product before 
inspection. For a product of moderate defectiveness of, say, 
up to 2-3 per cent, we can get for lots of 400 a limit fh the 
outgoing product of between -82 per cent, and IT per cent, 
only, and the actual defectiveness after inspection will, in 
most cases, be well below these limits. The result would thus 
not be materially different from what we could reasonably 
hope to get by 100 per cent, inspection. 

8.8 Modification of the Original Dodge-Romig System if the Article 
is Inspected for a Number of Defects Simultaneously 

(fl) Sometimes inspection is extended to a group of char- 
acteristics instead of to one characteristic only. Provided that 
the defects with respect to these characteristics are of the same 
seriousness, and may be considered additive, the Dodge- 
Romig method — with some modifications — can be applied to 
this kind of inspection. 

This extension is envisaged by Dodge-Romig on page 14 
of their article in Volume 20 of the Bell System Technical 
Journal, and so far they are right. They do not seem, however, 
to appreciate the changes that become necessary with regard 
to what is now to be considered the “ lot ” and the “ sample.” 
The lot size, in this case, is no longer given by the number 
of pieces, neither is the sample size. 

Supposing we were inspecting for four types of defects. 
Then a total of, say, 10,000 pieces, represents 4 x 10,000 
opportunities for defects, either occurring or not occurring. 
More precisely, our universe consists no longer of 10,000 pieces 
as before, but of 40,000 loci — ^that i.s, potential places for 
defects, or freedom from such. We can also think of so many 
holes, into which the defects fit like pegs. A hole without a 
peg is then considered as not defective. A “ lot ” is now a lot 
of such loci, A lot of, say, 400 pieces would represent a lot 
of 4 X 400 = 1600 observations. This is the lot size with which 
to enter the Dodge-Romig Tables. Similarly, the sample size 
n means n loci. As we cannot work with a fixed number of loci, 
but only with a fixed number of specimens, we must necessarily 
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pick a sample of pieces whose size is olDtainecl by dividiag the 
number given by tire Dodge-Romig Tables by the number 
of types of defects, four in our case. The letter c is now the 
number of defects, not “ defectives,” which is allowed to occur 
in the sample of n divided by 4. For our lots of 400 pieces, 
that is, ICOO individual potential places for defects, we now read 

n = 195 c = 14 

that is, we have to pick samples of n divided by 4, that is, 

— 48-76. We thus pick a sample of 49 and count the number 
of all the four types of defects occurring in them, passing up 
to 14. 

[b) Turning to our experimental frame, we have to modify 
our criterion for defectiveness. We regard a set of four figures 
as one piece and let the occurrence of 1 mark the article 
as defective. According to the place in which a 1 occurs, a 
different type of defectiveness is indicated. We may thus 
get sets of four, witlr 0, 1, 2, 3, or 4 defectives in them. 

As the percentage of four-figure sets with 1 in them is con- 
siderable, it was thought fit to reduce the defectiveness in some 
of the lots by regarding certain rows of figures as being free 
of defects, which served at the same time as a means for varying 
the lot defectiveness. 

Thus, for Lots I-X, defects were not counted in rows 1, 2, 
and 3, that is, these rows were considered as not containing 
any defects. 

For Lots XI-XV defects were not counted in rows, 1, 2, 3, 
and 4. 

Lots XVI-XX defects were not counted in rows 1, 2, 3, 4, 
and 5. 

In Lots XXI-XXV defects were counted in all rows. 

The results are exlhbited in the following Table 8.5. 
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QUALITY CONTROI. 
Tadlc 8.5 



No- of defects 

Total No. of defects 

6 

Lot No. 


In remainder 

In lots 

In lots 


In sample 


100% 

inspected 

accepted 

100% 



Accepted 

by sample 

inspected 

(11 

(17) 


118 


185 

(II) 

(16) 


92 


108 

(III) 

(19) 


111 


130 

IV 

11 

97 


lOR 


V 

12 

97 


109 


VI 

U 

94 


108 


(VII) 

(17) 


92 


109 

vin 

9 

98 


107 


IX 

8 

111 


119 


X 

14 

87 


101 


(XI 1 

(15) 


87 


102 

XII 

12 

73 


85 


XIII 

10 

77 


87 


(XIV) 

(17) 


75 


02 

XV 

7 

98 


105 


XVI 

12 

64 


76 


XVII 

13 

90 


103 


XVIII 

10 

71 


81 


XIX 

0 

71 


80 


XX 

11 

06 


77 


(XXI) 

(17) 


117 


134 

XXII 

8 

151 


159 


(XXIII) 

(25) 


128 


153 

(XXIV) 

(19) 


162 


171 

(XXV) 

(22) 


136 


158 


344 1345 1108 1505 1292 

/' ' ' V- ' 

/ 2797 2797 

Of this total 160 occur in 
not bracketed samples 
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Results 

(1) Total number uf occasions for defects .... 40,000 

Defects in samples of accepted lots 160 

Defects in lUU per cent, inspected lots .... 1,2!)2 


Outgoing Total 

Total number of defects after inspection . . . . 1,345 

Defectiveness of total ~ 3-0% 

Defectives before inspection = 7% approx. 

(2) Total number of inspection incidents : 

10 Lots 100% 10 X 1,600 = 16,000 

15 Lots by sample 15 x 195 = 2,925 


la,l )25 

Total number of pieces inspected ■= 18,925 ; 4 = 4,,731 

That is, little more than 47 pdl cent, of the 10,000 articles contained in the 

15 lots. 

Average mimber of pieces inspected per lot ~ 190 approximately. 

(c) From results (1) and (2) we see that the predictions made 
by the theory are again borne out by experiment. 

(a) Number of pieces inspected is less than half the total number 
—4,731. 

(i) The defectiveness in the outgoing total is 3-5 per cent., 
against the defectiveness before inspection of nearly 7 per 
cent., and is less than the limit of defectiveness given in the 
Dodgc-Romlg Tables under the given conditions, namely, 
4-4 per cent. 

It appears, therefore, that the method is equally suitable 
in the case of simultaneous inspection for a number of defects, 
provided the necessary modifications are made. 

8.9 The Shorthand Method of Inspection 

The rational sampling scheme enables us to determine 
the defectiveness in the outgoing total with great accuracy 
by choosing such a combination of n and c as will under the 
given conditions reduce the existing defectiveness below a 
stated limit. 

The consequence is that we know the quality of the out- 
going total just as well, if not better, than if we had tested 
everything. We have been spared that trouble and have had 
to test only a fraction, say less than 50 per cent, in our examples. 
Our knowledge of the quality of the product over a long 
(012) 15 
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period of time has thus been achieved with a considerable 
saving cf labour. The principle of reconstructing the picture 
of the total from the disconnected data provided by our sample, 
according to a rational procedure, reminds one forcibly oj that 
involved in shorthand. There we reconstruct the longhand 
text from the disconnected shorthand notes, by recognized 
rules. 

In a certain sense, all quality control by statistical methods 
can be conceived as resting upon the same principle. Our 
aim is always the determination of the quality of the total 
product from that found in disconnected samples, and the 
procedure which is to be observed if the estimate is to be reliable 
is based upon what we know about the statistical relationship 
between lot and sample quality. 

In the light of this analogue it becomes clear what an 
arbitrary sampling criterion — say 100 with no defectives among 
them — or any arbitrary method of judging lot by sample 
quality, means. It is as if every shorthand typist were to rely 
on Ills or her ingenuity in abbreviating or condensing the 
longhand text, and, consequently also on his or her intuition 
when it comes to reconstructing that text. Such procedure can 
have satisfactory results only in very exceptional cases, and 
it would therefore seem good sense to adhere to the recognized 
method there, and to the rational method in our case of 
industrial control of quality. 

But quality control does more than merely determine the 
quality of the product, in the sense of registering its changes 
during production or at inspection. In using statistical 
methods, we actively determine the quality by aiming at a 
stable process within specifications— during the process of 
manufacture — ^and by aiming at a stable quality of the outgoing 
product compatible with the requirements of both consumer 
and producer — ^in the inspection stage. This constitutes the 
characteristic fetiture of what we may call “ Dynamic Statistics.” 
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APPENDIX I 


„ Area under the Normal Curve in Terms of 
Deviations from the Mean 

This table gives the PTobabilily P, of an Observation 
falling outside the Limits ± - , wheie x is the Deviation from 
the Mean and a the Standard Deviation. 

For the Cumulative (“ Ogive ”) Curve, the Proportion of 
the whole area of the Normal Curve lying to the right of -j- 

P r P 

is given by jj, and that to the left of - by 1 — s* 

° 2 G A 


xjo 

P 

■v/ct 

P 

O'O 

1-000 

2-1 

•036 

01 

•920 

2-2 

•028 

0'2 

• S 42 

2-3 

•022 

O ' 3 

•784 

2-4 

■OIC 

0'4 

•089 

2-5 

■012 

Q'G 

•017 

2-6 

■009 

O'O 

•649 

2-7 

■007 

0-7 

•484 

2-8 

•005 

0-8 

•424 

2-9 

•004 

0'9 

•388 

3-0 

•003 

1-0 

•317 

3-1 ■ 

-002 

M 

•271 

3-2 

■0014 

1'2 

•230 

3-3 

■001 

1-3 

•194 

3-4 

•0007 

1'4 

•102 

3-5 

• -0006 

l-S 

•m 

. 3-6 

■0003 

t -6 

.110 

3-7 

•0002 

1'7 

•089 

3-8 

•00014 

I'S 

•072 

39 

•0001 

1-9 

•057 

40 

• 00 Q 06 

20 

•04 G 
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Instructions for Calculating the Mean and Standard 

Deviation 

(1) The most natural step in reducing a series of measure- 
ments to a more compact form is to find their average, i.e. to 
add the observations and divide by the number of items. This 
provides a single figure characterizing the whole group. 

It will be found useful to express operations like this in 
condensed form ; it makes the operation easier to remember, and 
by adhering to the same symbols for identical quantities it helps 
to make the representation of the matter more consistent. 

Let us call each observation X, and let N be the number of 
such observations. Let tlie operation of summing the observa- 
tions be denoted by S, and let M„ stand for the Average of the 

sx 

X’s, then Mx — jf. 

The information given by the average, useful as it is, as 
regards the central tendency of the series, leaves us still in the 
dark about its tendency towards dispersion or spread. In 
order to complete our information about the distribution it is 
therefore desirable to have a single figure which expresses the 
scatter or spread, just as the arithmetic mean expresses its 
average or level. The most important measure of this kind is 
the Standard Deviation^ 

Before writing down its formula we must have some way of 
writing any deviation. 

Using X and in tlie same sense as before, let the differ- 
ence between observation and average be denoted by a;, thus 

X - M, = a; 

X' - = x' 

X" ~Ux-= x" 

The x’s then represent the deviations of our X’s from M». 
Summing all the X’s -vVe have 

Sx = S(X - M,) = SX - SM* == SX - NM,. 

^ ; .'. NM* - SX 
Sx = SX - SX = 0. 


But 

Hence, 
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This is one of the fundamenlal properties of the average : 
the algebraic sum of the deviation from it adds up to zero. This 
means that we cannot use the deviations as they are for measur- 
ing the dispersion. But the arithmetic mean has another very 
important property. If the deviations were taken from any 
other value of the scale the sum of their squares would always be 
greater than is the sum of the squared deviations from the mean. 
This property is now used for arriving at a measure of dis- 
persion, the standard deviation. 

It is simply the square root of the mean square deviation, 
just as the average is the mean of the measurements to the 
first power. 


In symbols 



It can, however, be more easily computed once we know the 
average by the following formula 

which works with the observations X as they are, instead of with 
the deviations X — 

If, for the sake of simplifying the arithmetic, the deviations 
of the observations are taken not from the true mean, Mp, but 
from an arbitrary value A (in this way we can avoid having to 
work with decimals), the formula for the me^n then becomes 

I Mx = K -1- b, where b ~ 

according to the formula for the average in general. 

The standard deviation becomes then 






-KY 


N 


- b\ 


See also Fig. 5.7 and the calculation exhibited in paragraph 
5.6 for Grouped Distributions, 
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Factors for Converting 


{a) the Average Standard Deviation in Samples of n (s„) 

{b) the Average Range in Samples of n (»„) 
into the Population Standard Deviation (cr) according to the 
formulae a — ^ 


" = 7r 


n 

bn 

dn 

2 

0'504 

1-128 

3 

0-721 

1-093 

4 

0-798 

2-069 

6 

0-841 

2-320 

0 

0-869 

2-634 

7 

0-888 

2-704 

8 

0-903 

2-847 

9 

0-914 

2-970 

10 

0-923 

3-078 

11 

0-930 

3-173 

12 

0-936 

3-258 


(From Table 11, Davies and Pearson, Methods of Estimating from Samples the 
Papulation Standard Deviation, J. R. St. S., vol. i, No. 1, 1934) 
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APPENDIX IV 

Control Chart Limits for Averages 

Factors by which to multiply 

[a) The Average Standard Deviation in Samples of n (s„) 

[b] The Average Range in Samples of n (»„) 

in order to obtain the i distances of the 1/40 and I /1 000 
Limits rcspectivity from the Grand Average X. 


The Factors were calculated according to the formulae : 


1 /40 Limits 

1/1000 Limits . 

1-96 

bnVn 

3-09 
by, Vn 

1-96 
dn Vn 

3-09 

d,,Vn 


Inner limits 

Outer limits 

Inner limits Outer limits 

n 

Ao 025 

Ao-ooi 

A'l) 0-S5 

A'o-ool 

2 

24G7 

3-874 

1-229 

1-937 

3 

1-503 

2-464 

0-608 

1-061 

i 

1-228 

1-936 

0-470 

0-760 

6 

1-0*12 

1-643 

0-377 

0*594 

6 

0-921 

1-462 

0-316 

0-498 

7 

0-834 

1-316 

0-271 

0-432 

8 

0-707 

1-210 

0-244 

0-384 

9 

0-714 

1-127 

0-220 

0-347 

10 

0-072 

1-068 

0-202 

0-317 

11 

0-630 

1-002 

0-180 

0-294 

12 

0-605 

0-953 

0-174 

0-274 


aat 
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APPENDIX IV 


Control Chart Limits for Ranges 

Factors by which to multiply 
(fl) the Average Standard Deviation in Samples of n (sn) 

(b) the Average Range in Samples of n (a;„) 

in order to obtain the Inner and Outer Control Limits for the 
Range in such Samples . These factors were calculated accord- 
ing to the formulae 


* Range at required Prob. Level as 
Multiple of a 

Range at required Prob. Level 
as Multiple of 

bn 

dn 



Limits 



Limits 



Lower 

Upper 

Lower 

Outer 


Inner 

Outer 

Inner 

Outer 

Inner 

Outer 

Inner 

Outer 

n 

D'oss 

D-oo 1 

D-0 2 6 

D 00 1 

D'-0 2 5 

D'-ooi 

D'-o2S 

D'-ooi 

2 

■a 


5-018 


■030 


2-810 

4-121 

3 

■415 

•083 

6-086 

6-994 

•177 

•036 

2-174 



•739 

•251 

4-9S7 


•287 


1-933 

2-673 

S 

LOU 


4-995 



■169 


2-366 


1-220 

■622 


6-470 

•418 

•213 

1-720 

2-218 

7 

1-407 

-777 

6055 

6-452 

•462 

•266 

1-660 

2-120 

8 





•495 

■292 

1-620 


9 

■m 


6-143 


•622 

•323 

1-683 

1-986 

10 

mm 

1-170 

5193 


•643 

■351 

1-666 

1-940 

11 

1-914 


6-226 

0-494 

•651 

•378 

1-632 

1-904 

12 

2-008 

1-388 

6-265 

6-505 

■677 

•399 

1-610 

1-870 


It should be noted that the Factors A and D in the above 
Tables are multipliers for the Average Standard Deviation in 
Samples of n (s„), whereas the A and D factors of Table 10 and 
13 of British Standard Institution, 600 R: 194:2 are multipliers 
for the Population Standard Deviation (cr) . 

The values given for the Factors “ D' ” differ slightly from 
those in Table 13a British Standard Institution, 600 R; 1942. 
They had to be recalculated as those given in 600 R are slightly 
in error. 

* . . . The immerator in the formula is taken from Pearson and Hartley, 
“ The Prob. Integral of the Range . . Biornelrika, vol. xxicii, 1942 
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for the probability P that an event occurs at least c ti 


Number of defectives in samples of n units or number of 
defects in unit 


APPENDIX VI 


The and Limits for Number Defective Ch:krts 

(suitable if bulk defectiveness is less than 10 per cent.) 



Average Number of Defectives per Sample of n Units or Average Number of 
Defects per Unit 


m 



Fraction Defective or Q 


APPENDIX VII 

The three charts of this Appendix are after L. E, Simon, 
An Engineer's Manual of StcUistkal Method (1941) 


Chart 0-5 = I^ 



m 



Fraction Defective or Q 
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APPENDIX VII 


Chart 0*9 = 1 ^ 



Fraction Defective or Q 
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Chart 0*1 = I, 



Sample size or n 




APPENDIX Vni 


Table 1 

For a given combination of Sample Size and Number of 
DefectiveSj read with 50 per cent, probability the Lot Quality 
as percentage defectiveness which could have given rise to such 
Sample Quality : — 


Sample 

size 

No. or\. 
Defectives X 

no 

100 

200 

300 

400 

000 

0 

It 

•7 

-35 

•2.1 

•18 

•14 

1 

3'3 

1-7 

•84 

•58 

•42 

•33 

2 

ri'2 

2-7 

1-3 

•9 

■07 

•53 

3 

7'3 

S-6 

1-8 

I '2 

•90 

•73 

4 

9-0 

4-6 

2-3 

1'6 

n2 

•93 

0 

lit) 

5-7 

2-8 

I'D 

1-4 

M 

0 

13-0 

6'7 

3-3 

2-2 

1'7 

ns 

7 

10-0 

7-8 

3-8 

2'0 

1-9 

ns 

8 

16'8 

8.G 

43 

2'8 

2^2 

n7 

9 

19-0 

9.8 

4'8 

30 

2’4 

1-9 

10 

2D0 

10-5 

6-3 

3'5 

2-7 

2^1 
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Table 2 

For a given combination of Sample Size and Number of 
Defectives, read with 80 per cent, probability the Lot. Qjaality 
as percentage defectiveness which could have given rise to such 
a Sample Qiiality: — 


\Sample 
X^size 
No. \ 
of \ 

Defectives 

50 

100 

200 

300 

400 

600 

0 

•2- 4-5 

■] - 2.3 

•05-1 '2 

•036- -78 

■029- -68 

•Oil- -48 

1 

1-8- 7'« 

•5 - 3-8 

.26-2-0 

•17-1-3 

•14-1'0 

•10- -78 

2 

2'2-10-2 

M - 5-2 

•gS-2-6 

•38-1-7 

•28-1 .3 

•22-1 -1 

3 

3'5-12-9 

D8 - 8’5 

•88-3-3 

•64-2'2 

•46-1 -7 

•30-P3 

4 

S'0-16'3 

2'5 -- 7.8 

1-2 -4-0 

•80-2'7 

•C3-2'0 

•50-1-6 

6 

6-4-18’0 

3'] - 9 0 

l-68-4'6 

1-05-3-1 

•80-2-3 

•63-1 ’8 

fi 

8'0-20-0 

4-0 -lO'S 

1-96-5-3 

1-3 -3 6 

1-0 -2-B 

•78-2'l 

7 

0'r)-22'0 

4'7 -1P5 

2-36-5-7 

l-56-i3'9 

1-2 -2-9 

•93-2'3 

8 

l]-0~24-8 

S‘G -12-0 

2-7 -6-4 

1-8 -4 3 

1-4 -3-2 

M -2'6 

9 

12'9-27-0 

134 -14-0 

3-1 -70 

2-1 -4-7 

1-6 -3-6 

1-3 -2-8 

10 

14'4-29'0 

7-1 -15-0 

3-6 -7-6 

2-35-60 

1-8 -3-8 

1’4 -31 




230 


appendix vin 


Table 3 

For a given combination of Sample Size and Lot Quality 
in percentage defectiveness, read with 80 per cent. Probability 
the Range of the Number of Defectives such Samples might be 
expected to contain : — 


\Saniplt; 

Size 

Lot 

quality'^ 

.■iO 

100 

200 

300 

400 

500 

■5 

(1-1 

0 1 

0- 2 

(1- 3 

1- 3 

1- 4 

no 

0-1 

0- 2 

1- 3 

1- 5 

2- 0 

3- 7 

2'0 

0-2 

1- 3 

2- C 

3- 0 

6-11 

0-14 

3'() 

0-3 

1 - r > 

3- fl 

(i-12 

8-16 

10-10 

4'0 

1-3 

2- 0 

5-11 

8-1 It 

11-21 

15-25 

fl'O 

1-4 

3- 7 

7-14 

11-20 

15-25 

19-30 

G-f) 

2-5 

4- 0 

8-10 

13-23 

18-30 

23-36 

7'() 

2-0 

4-10 

10-18 

10-26 

22-34 

28-42 

S'O 

2-0 

5-11 

12-20 

18-30 

25-38 

33-47 

(M) 

.3-7 

0-12 

13-23 

21-30 

29-43 

37-53 

lO'O 

3-7 

7-13 

15-25 

24-36 

34-47 

42-58 
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Specimen of Dodoe-Romig Xables for Single Sampling 

Table SL-10 Lot Tolerance per cent. Defective = 10*0 % 
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APPENDIX ixb 



Size of First Sample ; ftn — Size of Second Sample ; entry of “ All " indicates 
that each piece in lot is to be inspected 1 
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Allowable Defect Niimber for First Sample ; » Allowable Defect Number 

for First and Second Samples Combined 
AOQL = Average Outgoing Quality Limit 





APPENDIX IXc 

Dodge-Romig, Single Sampling Lot Inspection Table — Based on Stated Values of Average 

Outgoing QuALrry Limit (Condensed) 

The body of this table gives such co-ordinated pairs of sample size, n, and allowable number of defectivj^, c, as will enable one to 
meet the stipulation that the defectiveness in the outgoing total shall not exceed a stated amount. 





CO tH lO 

1 ft 0 t'* 

CO 0 CO 

ift CO 0 

rt) r~( 

1 ft (F 


s 0 

0 ' 

Vi 



-d rH 

f>H r -4 0 | 

ei ei CO 

CO 




CC 70 C 5 

0 01 

05 ift Ift 

C 5 0 'ft 

0 d lift 

0 




-M 0 » 01 

CO CO 

rt- VO (50 

0 OJ CO 

CO 05 ei 

VO 

.0 

0 • 





. ,-H 

— ei 

cq 

0 

0 

in. 

'r 

01 W -tji 

'll *0 

50 I> CO 

0 ' 3 » 01 

CO VO 05 

aq 


A 





- ^ F-i 

^ r-H 

oi 


Q 

0 


CO CO -tt 

-t( — ( 

t'*' -f 0 

Ift >ft 0 

10 0 0 

Ift 


0 

s: 

>-1 01 

01 01 CO 

CO ■*f Ift 

50 VO CO 

00 JTH fft 

Ift 








--P fH 




«- 

0 | 01 TO 

If -ii ift 

VO I'* 0 

"■I (M H 4 

r- (N cO 

f»pf 






1— ( 

r-f »~< (-< 

r-f ei ei 

CO 


vO 









to 


Its lO 

CO t>. C 3 

0 ift tft 

0 Lft ‘ft 

Ift Ift 0 

0 



01 Cl TO 

-cf -^f CO 

t- 00 tM 

■ 5 f ‘ft 00 

(74 0 0 

ft 

-.0 

O'* 

A\ 





FH rt ^ 

ei fo 1^ 


0 

1 -Ci, 








ira 

v> 

rH el (M 

CO CO ” 5 f 

■ 5}1 1 ft Vft 

r- 00 05 

0 eft VO 

0 

A 






rH n-. 

ei 


vO 

O' 


CO vo 0 

VO r>. 00 

CG C 1 ft 

wft 0 ‘ft 

0 0 ‘ft 

‘ft 


10 

K 

Cl Ol 

CO CO rf 

rf f- 

00 0 cH 

fO c- -H 








r-l r-i el 




Vj 

t-H e-i 

01 01 CO 

CO VO 

1 -^ CO 05 

1 — H 

t'p 


•.0 






F-. sq 



C 











10 1- 0 

0 0 ift 

eft 0 0 

0 ‘ft 0 

QO Z> 

Ift 


AV 


CO (TO VO 

VO VO 00 

Ch <N 00 

FI -rf 00 

ift (CO (M 

‘ft 






(M CVl (M 

eft -xf 0 

05 

0 ' 

1 - 0 . 








0 








cs 

A 

Vi 

,<H inH 

01 01 M 

Ol CO 

1 ft 1 ft to 

r- 05 fivj 

tft 






rpj 



vO 









01 


1 ft r* CO 

0 0 1ft 

1ft 0 0 

1 ft 1 ft 10 

0 0 1 ft 

Ift 



CO CO CO 

CO « VO 

5 ft ft> IM 

Ift Ift CO 

74 05 05 

0 •• 






— t 

-t f- 

M Cft 

ft 




C ^ -1 

p-lftl 

0 ^ CO -vf 

1 ft 1 ft 

t- Q Ift 



’.0 






— ‘ i-( 

W 


O' 











Cl 4 ft 0 

0 1ft 0 

0 0 Ift 

<ft 0 0 

0 IQ 0 

0 


A\ 

ci 

CO 5 ft l> 

I-'* 04 

iM 00 CO 

(05 CO el 

cft cn o> 

ei 

0 



p-' ►'S 01 

ei {ft 

'Tf 50 (31 

VO 

A 

A 

V) 

00^ 

rH 

01 N CO 

W rH lO 

1 ft I> 05 

N 


nO 







FM 


O'' 











<M CO 0 

0 tft 1 ft 

0 C- 'ft 

‘ft ift Ift 

0 ‘ft Ift 

0 




CO CO I'' 

I'* 1 " 1 - 

01 CO CO 

CO c> 

FM CO L'F 

0 > 






.-I F-t <-P 

-H ei eo 

CO ‘(ft 

r- 



v> 

iC> 0 0 

r-f M r-* 

*4 01 Ol 

Cft rf Tf 

Ift ep rH 

50 








rH 



■'.0 










Sj 

1ft 0 0 

10 0 0 

Ift 0 0 

Jft (^3 Ift 

‘ft ‘ft 0 

0 



1 ft 5 ft CO 

<M CO 

V ’f 1ft 

Uft VD V- 

Ci Ift 

CO 


AV 




r-t 01 01 

M -(f Nf 

Ift 00 

fH 







F-T 

W 

10 

in. 








A 









I 0 

Vs 

coo 

0 rH i — 1 

FI r-i (N 

eq eft fO 

■«*< 1 ft I> 

0 


O' 







i-H 




1ft 0 0 

ift 0 0 

Ift tft 0 

1ft CO 0 

Ift Ift Ift 

s 




Ift « Cft 

CD CO tf* 

'tjf to Cft 

tft 50 t- 

0 (p F^ r- 






^ r-i 01 

(M eft cft 

Mi VO 00 

cq 









f-T 


1 0 








rS N 

bo 








5 ■?{ 

ft 







0 

^1 

Sh 

4 ; 

0 

M 



000 

<000 

000 

000 

0 

3 H 


'S 

000 

000 

000 

000 

000 

000 

000 

000 
0'' o' 0'' 

s' 

a 

•M 

n 

Cd CO 

, 10 «o t- 

cJ' 0" Co' 

'<«’ 1 ® [fF 

r-t ei up 

rH 



3 

^ Jt Jj 
0 4 i) €38 

^ J* 

1 1 ( 

Xj*j^ 

t 1 r 

^ (H r-* 

1 

S 

u 

Px 


000 

-rf la VO 

S 0 0 
t» 0 0 
SoT 

S 8 S 

eft -sjf Ift 

82 % 
i> o''o 

0 

0 

o' 








Ift 



APPENDIX X 


Va-wjes of the Variance Ratio (F) at the 6 per cent, 
(in black type), 1 per gent, (in roman type), 0-1 per 
gent, (in ilalics ), Level of the F-Distribution. 
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(This tabie is a condensed version of Appendix G2, Croxton and Gowdcr, 
Applied General Statistics, London, Pitman & Sons, Ltd.) 
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Hhis is intended to be a brief summary of instructions for 
the proper use of the important tests for identity of population 
if two or more samples are available. 

The question is always whether the differences which we 
observe between the means or standard deviations in the 
samples are compatible with the assumption that such differ- 
ences could arise by mere fluctuations of chance in samples 
coming from one and the same parent distribution. The 
answer is in the affirmative if the observed difference could 
arise by mere chance at least 6 times in 100, in the negative if 
it could arise less than 6 times in 100. 

The method consists in expressing the observed difference 
by the standard deviation in the sampling distribution, i.e. in 
the distribution of the statistical constant in question. The 
greater that ratio, the greater is the likelihood that the samples 
come from different distributions or that the difference is 
“ real.” 


I 


A — Significance of Difference between Two Means 
{It is assumed that the Patent distribution is normal, or neatly so.) 


(1) For large N 

Wc form the ratio 


Difference between Mea ns 
Standard Deviation of Means 


, in symbols 


t = where a? is the standard deviation in the popula- 

ffy/ VN 

tion of individuals and N the sample size. As a sufficiently 
close estimate of Op, we use the standard deviation in the 
samples {s). In general, i.e. for samples of different s and n, 
we have, remembering that the variance of a compound 
distribution equals the sum of the partial variances, 


t = 




=J=, which formula reduces to t 


+ 


•A 


2 

N 


Nx ' K 

if the samples have sensibly the same s and the same N. 

If now the quantity t exceeds 2 or 3, this means that a 
difference as observed, eitlter in the positive or negative 
direction, could arise by mere chance less than 1 in 20, or 1 in 
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370 times, respectively. As 1/20 already represents 5 per cent., 
we consider such differences to be significant, that is, as pointing 
to a " real ” difference between the samples. 

The connection between the multiple of a^/VN and the 
probability level is derived from the table of the Proba^Oility 
Integral or the Error Function (Appendix I). 

(2) For small N. 

The principle of the test is the same as before, but we cannot 
use the Error Function in this case for the following reason. 

The quantity- ^^ - ^^ is distributed normally if the parent 

distribution is normal, and the Error Function applies there- 


fore for the connection of a given multiple of 


with a level 


of probability, In large samples it is permitted, as observed 
above, to regard the standard deviation in the sample, s, as a 
sufficiently close estimate of the standard deviation in the popula- 
tion oj, and we can therefore rely on the probabilities as given 

by the Error Funcrion for certain multiples of In small 

samples, however, the standard deviation differs from that of 
the population, and this the more the smaller the sample. 


The quantity t =- 


JL LJ.U if S I ^•'"*'0 

This means that certain multiples of 


is no longer distributed normally. 


are now connected 


with probability levels different from those we found to corre- 
spond to such multiples in the Error Function. 

For testing the difference of means in small samples, 
therefore, we must use tables of h different function, called 
“Student’s t function” (after Student, the pen-name of its 
discoverer), of which a condensed version is given below. 

The quantity t is obtained according to the formula 


t ~ Xi,- 


j NxN, (Ni + N, 


(Ni+ Nj) 


where are the deviations from x^, x^ respectively. 
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TABLE OF “ STUDENT’S r’ 

The values in the body of the table represent such 
multiples of the Standard Error on both sides of the Mean 
as are required for accommodating specified percentages of 
t, viz. 60 per cent., 90 per cent., 96 per cent., and 99 per 
cent, for a given number (n) of Degrees of Freedom. The 
percentages P, of t occurring beyond these values are 60 per 
cent., 10 per cent., 6 per cent., and 1 per cent, respectively, and 
the table is so arranged as to refer to there percentages. 

Table of “ Student’s i ” 







T 1 

\ P 

n 

•60 

•10 

•06 

•01 

1 

I'OUO 

6-:!4 

12-71 

03-66 

2 

•810 

2-92 

4-30 

9-62 

.s 

•766 

2-35 

3'18 

5-84 

i 

•741 

2-13 

2-78 

4-00 

0 

•727 

2-02 

2-57 

4-03 

6 

•718 

1-94 

2-46 

3’71 

7 

•711 

1-00 

2-36 

3-60 

8 

•700 

1^86 

2'31 

3-30 

9 

•703 

1-83 

2-26 

3-26 

10 

•700 

1-81 

2-23 

3-17 

11-16 

■697-091 

1 •80-1-75 

2-20-2-13 

3'll-2'95 

16-20 

• 600-087 

1-75-1-72 

2- 12-2-09 

2'92-2-84 

21-;!0 

■686-683 

1-72-1 -70 

2-08-2-04 

2’83-2-75 

31-100 

■082-677 

1-69-1-66 

2-03-1-98 

2'72-2-63 

101-1000 

■677-764 

1-66-1-65 

l-98-l'96 

2-G3-2-58 

00 

•674 

1-64 

1-96 

2-58 


The table is entered with « = N — 1 on the vertical scale if 
the samples are of the same size, and with « = Ni Nj — 2 if 
they are of different size. Our observed value of t is then 
compared at this level with the values in the body of the table, 
and the probability with which to expect a difference like ours 
on mere chance is then read on the horizontal scale, it lor a 
given K, the observed t is greater than the corresponding 
value in column P = ’05, this means that such a value of t 
has a probability of less than 6 per cent, in samples coming 
from one and the same lot, and we consider it to signify different 
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B — Significance of Difference Between Two Standard 

Deviations 


(1) For large samples * 
We form the quantity 


which is the standard deviation 

V2N 


in the distribution of standard deviations of samples of N, 
and proceeding as before— that is, according to whether the 
observed difference between two standard deviations exceeds 

two or three times the quantity we conclude, on the 

V2N 

basis of the Error Function, that such differences could occur 
less than 1 in 20 or 1 in 370 times. If so, we regard the 
difference as pointing to a real difference between the two 
samples ; otherwise, their variability is assumed to be essen- 
tially the same. As a sufficiently close estimate of Uj, we use 
again the observed standard deviation. 


(2) For small samples and large samples in non-normal distributions 
We form the ratio of the squares of the observed standard 
deviations -i_, and use the tables of the Ratio Variance 

Si 

(based upon the Beta-Function, Appendix X) for ascertaining 
how often such a ratio is likely to arise if both samples have 
come from the same ^population. As the fundamental level of 
probability seiwing as the border-line between “ apparent ” 
and “ real ” differences, we again take the 6 per cent, level 
(see Chapter IV, paragraph 4.3, 4.5). 


II 


C — Testing the Significance of Difference Between 
More than Two Means 


We form the quantity 


n (Variance between Sample Means) 
Average Variance witlnn Samples of n 


which is familiar from Chapter IV. 


In, symbols 


wVar] 
Varj ' 


We now test 


'Varj 


as if aV arj. was the variance of a second 


* Normality of Parent distribution assumed 
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sample differing in size from lliose from, which Var^ was 
obtained. That is, we use exactly the same kind of test as Was 
considered in B (2) where the standard deviations were com- 
pared for their own sake. 

The explanation for this procedure is given in Chapter IV, 
dealing with the analysis of variance method. 


D — SiGNIFIGANGE OF DIFFERENCE BETWEEN MORE THAN 
Two Standard Deviations 

Denoting the variances by Vj, Vg, Vg, V 4 . , we form 
the quantity 

Geometric Mean of Variances 
Arithnactic Mean of Variances 

in symbols 

T - « . yv. . V. ■ V3 ■ V4 ■ ■ . V„. 

Vj -|- Vj + V3 + V4 -b V„ 

One properly of L is that it decreases in value as the 
variances differ more among themselves. It has a maximum 
value of 1 if the variances are equal. The 5 per cent, prob- 
ability level of significance for various combinations of 
degrees of freedom, that is, for different numbers of variances 
(«) and different sample sizes (N) can be read from tables. 
If the value of L sinks below that level we conclude that the 
differences between the variances are real. 

(This test is due to E. S. Pearson and Neyman, 1931, and 
is in this foi’ni applicable if the number of specimens, N, is the 
same in all samples.) 

Ill 

E — Significance of Difference Between Frequencies 

If both mean and standard deviation are sensibly equal, 
the question of whether both samples have come from the same 
population still depends on whether the same type of Law of 
Chance governs the distribution in both samples and in the 
population. The criterion for this most important question 
Ues in the corresponding class frequencies. If these do not differ 
significantly for given values of the variable from one sample to 
the other, and between samples and population, and if mean 
and standard deviation are sensibly the same, then we can 
be satisfied that the samples have come from one population. 

. For testing the significance of differences between the 
corresponding class frequencies in two distributions whose mean 
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and standard deviation are sensibly the same, the x'^' "test 
is applied, of which the barest outline has been given in 
paragraph 3.5. 

IV 


F — Significance of Difference Between Fraction or 
Percentage Defectiveness 


(a) Comparing two sampling results 

If two samples of sizes and show m-y and m.^ defectives 
respectively, the difference of sample qualities, ^ is 

tested for significance b y the 

Standard Error = J P b ) + 
which for samples of equal si ze, Uy = — n, reduces to 

Standard Error 

If the difference _ £i, or for equal sample size^ - 

Uy n Ui 

exceeds twice the standard error, this is taken as denoting that 
the samples represent different populations. 


(fi) Comparing more than two sampling results 

We calculate the mean defectiveness q, for all the samples 
drawn from different lots, and form the standard deviation for 

a homogenous population as a' where p q — \ 


and n is the total number of specimens (see para. 6.5). We 
now calculate the standard deviation of the deviations, X{, of 
our sample qualities expressed as fractions from the mean 


defectiveness, q, according to the formula cr" 



The quantity Q. = ^ then affords a measure of the homo- 

<7 

geneity, or otherwise, of the population. The more it deviates 
from unity, the less likely is it diat the probability of a defective 
is the same for aU the lots from which the samples were drawn. 
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Metiiods of Condensing the Information Derivable from 
Control Charts 

(1) There are various methods for condensing the informa- 
tion contained in the quality control charts so as to give, on 
one hand, tire higher management a concise idea of the 
quality of the work over a certain period, and, on the other 
hand, to stimulate production in the shops by exhibiting the 
quality of the product. 

One of these methods consists in providing the higher 
management with a weekly summary containing all the 
necessary information about the week’s work as far as the 
operations are under quality control supervision. 

The management can read in those forms the extent to which 
the control chart technique was employed in a given production 
branch, the number of charts, number of persons employed, 
number of samples witliin and without control limits — and thus 
the state of statistical control — the number of samples within 
and without specifications — and thus the state of economic 
control. The quality control foreman, on the other hand, by 
summarizing ihe week’s inspection results in this way, is in 
a position to inform the shop people reliably on the quality 
of their product, and so to give them an idea to what extent 
their work comes up to specifications. 

(2) It is, however, feasible to condense still further the in- 
formation based upon, say, a week’s experience by using a Qiiality 
Index — that is, a single figure — vindicating the quality of the 
product.* Needless to say that a single figure like that can be 
arrived at only by sacrificing a great amount of useful inforrna- 
tion, but it has, on the other hand, the advantage of facilitating 
comparisons between different weeks. Whether the loss of 
information is outweighed by the increase in conciseness is a 
question which the higher management and the shop manage- 
ment must decide for themselves, and the preference for one 
or the other method will be very often a matter of taste. _ 

The principles underlying the construction of sych an index 
are these. From the specifications it is easy to arrive at control 
limits for 100 per cent, perfect product (see paragraph 2.6). 

* The conception of the duality Index is due to Mr. B. Taylor and Mr, 
Ri F. H. Banistei. The mathematical structure given here differs, however, 
considerably from the one used by Messrs. Taylor and Banister, 

, 243 
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It is Lhcn possible to draw control limits at such distances from 
the specification mean as would result from say, 5 per cent., 
10 per cent., 15 per cent, defective product. For this purpose 
we again make use of the properties of normal distributions 
to which, we can assume, our distributions of sample means 
will satisfactorily approximate. 

For 100 per cent, perfect product, that is, for product of 
which practically nothing exceeds specifications, the distance 
specification mean-tolerance (be it upper or lower) corresponds 
to 3 (t approximately, and the distance specification mean — 

1/1000 limit to or, denoting the standard deviation of the 
means by o', to 3^'. 

Know, 5 per cent., say, of the product exceeds specifications, 
the range, and, therefore, the standard deviation in both, 
parent distribution of individuals and sampling distribution of 
means, increases. Let us denote these new standard deviations by 
<ji and Cl' respectively. It is evident that the distance speci- 
fication mean-tolerance will now be less than 3ci, and the 
distance between the specification mean and former control 
limit less than 3ci'. What we are now interested in is what 
fraction of 3ci' corresponds to the control limit distance for 
perfect product. Knowing that fraction, we can easily obtain 
Cl', and from it the new control lines for 6 per cent, defective 
product. 

For obtaining the answer to our problem we consult the 
tables of the Probability Integral, Appendix I. According to 
our stipulation, 95 per cent, of the product is to lie within 
specifications. We now read from the tables that 96 per cent, 
will lie within ± 1'96 cti from the Mean, or 

Distance Spec. Mean-Tolerance = 1-96®!, 
and Distance Spec. Mean — l/lOOO 
limit for 100 per cent, perfect product = 

Vn 

Denoting that distance by D, and putting ax for -2X, we have 

Vn 

D = 

.-.oi' = -61 D. 

We then obtain our new control limits at M 4: 3ai', or 
M ± 1-63 D. _ 

In order to obtain control limits for other levels of defective- 
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ness, we proceed in a completely analogous manner (see below, 
where the technique is condensed into eight brief rules). 

If now the points within control limits are counted, and at 
the same time weighted according to whether they lie beyond 
the fi.>iist(5% defectiveness) but within the second pair (10%) 
of control limits, beyond the second but within the third (16%) 
pair, and so forth, we obtain a number of “marks” for each 
chart. This figure is then deducted from the total number of 
points plotted on the chart, and the resulting figure divided by 
the total number. The ratio is called the Quality Index, and 
it is clear that it will be the smaller the greater the number 
of outliers and the farther away from the mean they occur. It 
should, however, be kept in mind that if the quality index is, 
say, 95 per cent., this does not mean that the product is 95 per 
cent, perfect. The method of marking introduces a strange 
element, with the result that the index figure is just as its name 
implies — an index, but not an exact description of quality. 

Apart from forming the frame-work for the quality index, 
the pairs of control lines for different quality levels, if put into 
the margin of the chart, enable txs to read at a glance the 
approximate level of defectiveness of our product by comparing 
these frame lines witlt the actually obtained limits in the body 
of the chart which is, perhaps, more important than the index 
figure. 

(3) The technique for arriving at the control limits and the 
’quality index figure can be condensed into the following 8 rules : 

1. Obtain 1/1000 control limits for 100 per cent, perfect 

product from tolerances by dividing by V n {n for 

sample size). Half the distance between these limits (D), 
measured on eitlrer side of the specification mean, represents 
three times the standard deviation of sample averages, a', 
for perfect product. 

2. Let D = l-OGa-/ ; = -SI D. Then the specification 

mean M d: or M ± 1-53 D, gives the 1/1000 limits for 

averages of samples of n if the product is 6 per cent, defective 
(both sides taken together). 

3. LetD = LOScTa' Ua' = -606 D. Then the specification 
mean M i Sug', or M = 1'82 D, gives the 1/1000 limits for 
averages of samples of n if the product is 10 per cent, defective. 

4. LetD = l-44CTg' ; 03' = -6940. Then the specification 
mean MdhSffa', or M =2-08 D, gives the 1/1000 limits for 
averages of samples of n if the product is 16 per cent, defective. 
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6. From tire control limits based upon a' , Og', 03') it 
is possible to obtain the Average Range {w„) by dividing the 
quantities 3(r' Sug', SjTj') by (see Appendix IV). 

The control limits for the range are then obtained by multi- 
plying Wn by the factors listed in Appendix IV. 

6. The specification mean, together with the four pairs of 
1/1000 limits for 0 per cent., 5 per cent., 10 per cent., 16 per 
cent, defective product obtained according to steps 1-4= for 
the chart of averages, and step 6 for the chart of ranges, are 
put into the margin of the respective chart which otherwise 
uses the control limits calculated from actual samples and 
recalculated from time to time. 

At the end of the week (or the chart), the margin limits 
are prolonged in pencil over the whole length of the chart, 
and the points within and without limits are counted. 

7. Treat the average and range chart for a given char- 
acteristic as one chart for the purpose of the quality index. 
Sum the points within the 3cr' — Scr^', 3ai' — Sag', 3crg'— Sag' 
bands on both charts, counting a sample which appears in 
a given band on both charts as one, and deduct for samples 
within 

3 ct' to 3(7 i' 1 point or 1 mark 

So-i' to ScTg' 2 points or 2 marks 

ScTg' to Sctj' or further . . 3 points or 3 marks 

Deduct from the total number of points the sum of all 
“ marks,” and divide by the total number of plotted points, 
that is, the number of sample averages. The quotient multiplied 
by 100 serves as tire quality index. 

8. For number defective charts, set a target at an accept- 
able limit of defectiveness [d). For a 6 per cent, increase in 
defectiveness find the 1/1000 limit for [d + 5) per cent, mean 
defectiveness (according to the sample size), and then proceed 
as with measurement charts (stages 6 and 7). Similarly for a 
10 per cent, and 16 per cent, increase. 
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Acceptance, probability of, 107, 198 

Accidents, frequency distribution of, 
139, 188 ; defectives as, 141 ; 

control lines for distribution of, 142 ; 
differing liability to, 188 

Action on basis of control chart, 7, 
26-31, 45, 64 

Additive property of variances, 91 

Ammunition, illustrative examples for 
controlling quality of, 26 sqq. ; and for 
inspection of, 178 

Analysis of variance, principle of, 
78 sqq. ; parallelism with quality 
control chart technique, 76 ; its 
use in testing significance of dif- 
ferences between means, 240, be- 
tween standard deviations, 240, be- 
tween percentages, 242 ; its relation 
lo trends, 105 ; simplified methods 
of computation, 86, 89 ; multiple 
variate analysis, 99 

Arithmetic mean, calculation of, 218 ; 
from arbitrary value, 219 ; standard 
error of, 21 

Association, between two variables, 
105 ; measure of, 119 ; see also 
Correlation 

Asymmetrical or skew distributions, 
69 sqq. 

Attributes, see Sampling of attributes 

Averages, of samples, 18 setq. ; grand 
average, 22 ; distribution of stand- 
ard deviation of, 21 ; control lines 
for, 22 ; factors for calculating con- 
trol lines, 221 ; of range, 22 ; of 
standard deviation, 22 ; their im- 
portance for industrial control in- 
spection, 43 ; see also under Mean 

Bernoullian law, 173 

“ Best fit ” of regression lines, 118, 136 

Beta-function (also jG-function), charts 
for incomplete (S-ianction ratio, 
226 sqq. ; tables, 228 sqq. ; instruc- 
tions for using charts, 178 sqq. 


Binomial distribution, 175 ; standard 
deviation of, 146 ; with negative 
index, 189» ; probability graphs of, 
187, 190 

Bivariate distributions, 109 ; intensity 
of dependence. 111 ; diagrammatic 
representation of, 114 
Borderlines between stability and dis- 
turbance, 16 

Bortkiewicz, L. von, data of death from 
kicks by a horse as Poisson distribu- 
tion, 139 

Brass discs, distribution of thickness of, 
60 ; control chart for, 39 
Brass shells, distribution of diameter, 
70 

Bravais, formula for correlation, co- 
efficient, 115 ; working formula, 
115 ; illustration of working with 
Bravais's formula, 117 
Bulk V. sample quality, 171 sqq., 
193 ; in the case of probability 
graphs, 63 sqq. 

Carap-Meidell inequality, 68 
Cartridges, assembling of, examples of 
control charts, 26 sqq. 

Causes, assignable v. chance, 26 ; 
probability of, 172 ; and effects, 
not in one-to-one correspondence, 
173, 177 

Cells in X“"test, of not less than five 
items, 56; comparison of frequencies, 
241 ; see also under Chi-square 
Chance, as complex causation, 4 ; 
of success or failure of an event, 
175 ; laws of, 2 sqq. 

Chances, small, as generating Poisson 
distribution, 140 
Charts, see Control charts 
Chi-square, S3 J its relation to Poisson 
summation function, 67 
Class interval, choice of magnitude of, 
62 ; its influence on. magnitude of 
standard deviation, 63 
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Coefficient ol’ covrelation, 11 5, 122; of 
alienation, 120 
Compass of chance, 127 
Compound Poisson distribution, 187 sqq. 
Condensing of information, dB, 243 
Constraints in X”-d*stribution, 56 
Consumer’s risk, 194 
Control lines (charts), statistical, prin- 
ciple of, 44 ; in gunnery -work, 6 ; 
in production, 7 ; statistical and 
economical, 16 ; for averages, 16 ; 
for ranges, 36 ; elliptical, 129n ; 
inclined, 108 ; non-linear, 130 ; for 
number defectiveness, 142; modi- 
fied, 30 ; for averages of number 
defectiveness, 162 sqq. ; factors for 
calculating, 221 ; for lot quality, 182 
Correction, of sample standard devia- 
tion for size of samples, 220 ; of 
standard deviation for grouping, 63 
Correctness t/. definiteness in sampling 
inspection, 171 

Correlation, generally, 127 sqq. ; re- 
lation to analysis of variance, 105 ; 
relation to quality control chart, 
121 ; correlation tables, 110, 116 ; 
represented by scatter diagram, 
112 ; product-moment coefficient of, 
116 sqq., 122 ; linear, 118 ; curvi- 
linear, 130 ; test for linearity of, 
123 ; geometrical method for esti- 
mating coefBcient of, 114 ; ratio of, 
119 ; multiple, 134 
Corrosion, experiment on, illustrating 
multiple correlation, 134 sqq. ; illus- 
trating multi-variate analysis of vari- 
ance, 99 

Cruber, E., 162n 

Da ta, condensing of, 46, 243 
Davies, O. L., and E. S. Pearson, 220n 
Defectives as accidents, 141 sqq. ; 
binomial distribution of, 146 ; num- 
ber and fraction defective charts, 142 
Definiteness a. correctness in drawing 
conclusions ftom sample quality, 171 
Degrees of freedom, in. X“-test, 66 ; 

in analysis of variance, 85, 89 
Deming, W. E., quotation from, 193 
Dependence between variables. 111 
Description, shorthand method of, 213 
Deviation, root - mean - square, 218; 
standardizing deviations, 114 ; see 
also under Standard deviation 


Diameter of paper discs, distribution of, 
84 sqq. ; of brass shells, 70 
Dispersion, measures of, 217 ; range 
as a measure of, 22, 36 
Distribution of frequency of observa- 
tions, 3 ; for measuremenfi, 13 ; 
for number or fraction defectiveness, 
142 ; sampling distributions, 10 
Dodge-Romig, vii 
Double sampling technique, 206 
Dudding, B. P., vii 

Economic v. statistical control, 16, 
147 

Elimination, method of successive, 99, 
134 

Error function, as integral of normal 
curve, 4 ; tables for, 217 
Error, law of, 3 ; frequency of, as 
function of deviation from mean, 4 ; 
mean square or variance, 85 
Error, standard, definition, 21, 146 ; 
of means in normal population, 20, 
237 ; of number or proportion of 
defectives, 146 ; of standard devi- 
ation, 240 ; of e.slimate, 124 
Experimental design, 99 

Factors, for calculation of control 
limits, 221 ; for obtaining the 
standard deviation in the popula- * 
tion, 220 

Fechner, G. Th., 72n 
Fisher’s .c- table, 83 

Fitting of curves, to data, on prob- 
ability graph paper, 60 ; goodness of 
fit, tested by X "“method, 64 sqq. 

Form of rounds of ammunition, dis- 
tribution of, as example for erratic 
variations, 165 ; for compound 
accident distribution, 187 
Freedom, degfees of, see Degrees of 
freedom 

Frequency, distribution, 3 ; forming of, 
62 ; graphic representation of, 4, 
46 ; symmetrical and asymmetrical, 
60; following the Normal law, 13 ; 
J-shaped, 140 ; binomial, 175 ; 
Poisson, 139 

Frequency polygon, see Histogram 

Galton, Sir F., method of correlation, 
72b, 111 
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Gauging, ld3 ; made more severe than 
according to specifications, 167 
Gaussian law, see under Normal law 
Geometric representation of frequency 
distributions, 4 ; of cumulative^is- 
trib^tions, 40 ; of variations be- 
tween sample means, 13 sqq. ; be- 
tween sample ranges, 36 ; between 
means and standard deviations on 
probability graphs, 67 ; test for 
normality of distribution, 47 ; method 
for obtaining coefficient of correla- 
tion, 111 

Graphs, see Probability graphs 
Greatest and least values of samples, 
see Range 

Greenwood, M,, 188n 
Grouping of observations, 62 ; effect 
upon standard deviation, 63 ; im- 
portance of, according to suspected 
assignable causes, 43 

Hardness, chart for controlling, 130 sqq. 
Hartley, H, O., and Pearson, E. S,, 220« 
Hazen, A., 47n 

Height of filling, distribution of, 109 ; 

correlation with weight. 110 
Histogram, 4 

Human bias in sampling, 204 

Incidence of sampling, 44 
„ Incomplete Beta-function, 178 ; charts, 
225 sqq. ; tables, 228 sqq. 

Index, quality, 243 

Industrial control inspection, 17 ; 
sampling methods, 171 ; fallacies, 
172 

Inequality, TschebychefF’s, 06 ; Canip- 
Meidcii’s, 67 

Interval, between samples, see Inci- 
dence 

Irwin, J. O., I63n 

Inspection, sampling, 193; faulty 
methods of, 171 ; rational principles 
of, 172 ; Ig charts for, 225 sqq. 

jennett, W. J., vii 

Kapteyn, J. C., Tin 
Kendall, M. G., 105n 
Kick by a horse, deaths from, following 
Poisson law, 139 

Labour-saving methods for obtaining 
control limits, 144 ; for analysis ol 
i>ariance calculations, 86, 89 


Lack of control, as shown by quality 
control charts, 27 sqq. 

Least square.s, standard deviation as 
sum of, 211) 

Leptokurtic curves, 68 
Levels of significance of differences, 
80 sqq. ; tables for variances, 236 ; 
for means, 217, 239 
Levy, H., and E. E. Preidel, on, \iln 
Limits of confidence, on quality control 
charts, 16 ; calculation of, 22 ; 
meaning of 1/40, 1/1000 limits, 14 ; 
set in from tolerances, 30 ; for 
qualitative charts, 142 ; for sample 
averages on qualitative charts, 1 62 ; 
see also under Control lines 
Linearity of Regression, 118 ; test for, 
123 

Lines of control, see Control lines 
Loading cartridges with propellant, 
illustration for linear correlation, 
109 sqq. 

Logarithmic frequency distributions, 
72 ; plotting on logarithmic prob- 
ability paper, 72 

Lot V. sample quality, 171 sqq. ; charts 
“ of function connecting both, 226 sqq. 
Lot tolerance defectiveness, 194 

McAlister, 72n 
Mathematical tools, 40 
Mean, arithmetic, its properties, 218 ; 
calculation of, for grouped distri- 
butions, 117 ; of Poisson distribu- 
tion, 143 ; standard error of, 21 ; 
significance of difference between 
sample means, 237, 238 ; between 
more than two means, 240 
Mean-square root, see Standard devi- 
ation 

Measurements, distribution of, 5 
Meidell-Camp inequality, 67 
Metal rods, length of, example for 
Poisson distribution, 186 
Method, statistical development of, 9 
Modified control chart technique, 30 
Multifactorial design, 99 
Multiple correlation, 134 

Neyman, T., and Pearson, E. S., 241 
Normal law or distribution, 2 ; as a law 
of chance, 4 ; its use in ballistics, 7 ; 
in the control of manufactured pro- 
duct, 7 ; tendency of averages to be 
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normally clistribuLed, 17 ; integral 
of, 4 ; table of error function, 217 ; 
sampling distribution for averages 
from, 21 ; for standard deviations 
from, 240 ; for the range in samples 
from, 222 

Numerical examples for calculating 
control limits, 22 ; of calculating 
correlation co-efficient, 117 

Ogive curve, 4S ; construction of, 50 ; 
transformation into straight line, 00 ; 
logarithmic ogive, 72 ; transforma- 
tion into straight line, 73 

Pearson, E. S., and Davies, O. L., 220« 
Pearson, E. ,S., and Plartley, PI. O,, 
222n 

Pearson, E. S,, and Ncyman, T., 241 
Percentage (or proportion) within and 
outside of specifications, 65, 74 ; 
estimation of, by means of inequality, 
Tschebycheff ’s, 86; Camp-Meidcll’s, 
67 

Platykurtic curves, 09 
Poisson distribution, mean and stan- 
dard deviation, 143 ; of sample 
means, 163 ; y'.v distribution, 153 ; 
probability graphs, lfi7, 190 ; com- 
pound distribution, 187 ; summa- 
tion charts, 223, 224 
Population as different IVoin samples, 
1,213 

Precision, of measurements and group- 
ing, 62 ; h as measure of, 3 
Probability, law.s of, 2, 140, 176 ; of 
causes, 177 sqq, ; graph paper, 
40 sqq. ; of fire, 7 ; of good pro- 
duction, 7 ; confidence limits, 14 ; 
of deviation from mean, 13 ; of 
differences between sample means, 
19 ; levels of significance, 217, 221, 
223, 224, 235 , 2;j9 ; integral, 4, 217 
Probable error, 7 
Process average defectiveness, 197 
Producer’s risk, 194 

Quality control chart technique, prin- 
ciple of, 44 ; in conjunction with 
specifications, 24 ; modifications of 
39 sqq. ; based upon binomial or 
Poisson law, 142, 140 ; qualitative 
charts for averages, 162 ; if no 
defectives arc to be tolerated, !67 


Quality, lot as judged from .sample, 
171 sqq. ; improving lot quality 
by process control, 7-9 ; improving 
lot quality by rational inspection 
scheme, 200 ; relation between lot 
and sample, according tqo Beta- 
function, 17B sqq. ; index of 243 

Random sampling, technique, 204 ; 
versus selection, 44 

Range, as a measure of dispersion, 22, 
221 ; factors for transforming into 
standard deviation, 220 ; control, 
charts of 36 ; factors for obtaining 
control limits, 222 ; maximum range 
for specified number of measure- 
ments, 38 

Reducing defectiveness by sampling 
inspection, 200 

Regression, equation of straight line, 
118, 136 ; curvilinear, 130 ; test of 
linearity, 123 ; control limits, 124, 
133; as compared with control lines 
of quality control chart, 126 
Rejection limits, 196 
Residuals, in analysis of variance, 86 ; 

in correlation analysis, 119, 122 
Romig-Dodge, vii, 194 sqq. 

Sampling of attributes, qualitative, 
141 ; standard deviation of number^ 
and proportion of given attribute, 
140 ; tests of significance, 242 
Sampling distribution (vafiable,s) , of 
means, 21 ; of standard deviation 
(range), 222, 240 ; for .small samples, 
238, 239 

Sampling inspection, 193 sqq. ; ra- 
tional scheme, 199 ; consumer’s 
risk in, 194 ; producer’s ri,sk in, 196, 
196 ; tables for simple and double, 
199, 231-234 ; limiting outgoing 
defectiveness by, 199, 231-234 ; illus- 
trations of efficiency of 200, 204, 209 ; 
rectifying e. acceptance -rejection 
method, 200 

Sampling methods, random, 204 ; 
non-random, for quality control 
purposes, 43, 44 
Scatter diagram, 112, 136, 138 
Setting variations, 29, 97 
Shell fuses, inspection of 178 
Sheppard’s correction for grouping, 63 
Shewhart, W. A,, vii 



INDEX 


Shots, scatter of, in artillery practice, 7 
■ Shots at target, distribution of, 3 

Signihcancc, tests of, 237 sqq. 

Simon, h. E., vii, 173 

Skew distributions, fit) sqq. 

Small samples, attiibutes, 173 ; vari- 
able% effect on sampling distri- 
bution, 23, 221 ; t-distiibution, 

239 

Standard 'dtviation, as root -mean 
square deviation, 13, 219 ; calcula- 
tion, 22, 219 ; of arbitrary origin, 
219 ; for grouped data, 1 17 ; multi- 
ples of, and frequency of observa- 
tions, 14 ; of a series compounded 
of others, 93 ; of arrays in bivariate 
distribution, 120 ; of binomial dis- 
tribution, 146 ; of Poisson distri- 
bution, 143 ; of distribution of 
means, 21, 153 ; factors for irans- 
foiflllng standard deviation of sample 
.into standard deviation of popula- 
tion, 220 ; constant for 1 /^ distribu- 
tion, 163 

Slandard evrov, see Error 

Statistics, control of quality by, 12 ; 
advantages, 15 ; relation between 
main branches of, 0 ; as a basis for 
action, 28 sqq., 4,6, 04 ; of inspection 
procedures, 193 sqq. 

Steel, protection of, against corrosion, 
90, 134 

Straight line, iitletl to data, 1 LH, 13,6 ; 
transformation of cumulative curve 
into, 47 ; advantage of doing this, 
74 ; test of good fit, 54 
Student ” (pseudonym), standard 
error of the mean, 238 ; comparison 
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of small samples on ba.sis of 1-fuiictioii, 
23!) 

<-distribution, see “ .Student ” 

Target, working to, 119 
Tippett, L. H. C., 20.6 
Tolerances, in specifications, 17 ; are 
not control limits for average, s, 18 ; 
control limits set in Irnm, 33 
I’ool wear, indicated by control chart, 
30 ; by probability graph, (14 
Trend, 35, 10.6 
Tschcbycheff’s inequality, 68 
Type in biological statistics, 10 

Universe, alternative term lor popula- 
tion or lot, 10 ; estimating standard 
deviation of, 220 

Variability, of observations, 3, 9 ; 

clue to complexity of causation, 5 
Variance, term for squared standard 
deviation, 78 ; correlation coefficient 
as part of, 119 

Velocity of rounds of ammunition, 
control cliarL for, 107 

Wear of gun barrel, 107 
Wciglu of propellant correlated with 
height of filling, i 09 
Weights, of component of mixture, 
tlieir distribution, 61 
Woi-king formula for calculating corre- 
lation coefficient, 116 
Working mean in modified control 
chart technique, 3 1 

Yule, G. Udny, 105«, 18S«, 19Dn 
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